Annealing Paths for the Evaluation of Topic Models - PowerPoint Presentation

348 views
Uploaded On 2018-09-23

Annealing Paths for the Evaluation of Topic Models - PPT Presentation

James Foulds Padhraic Smyth Department of Computer Science University of California Irvine James Foulds has recently moved to the University of California Santa Cruz Motivation Topic model extensions ID: 677267

model topic models ais topic model ais models ratio temperature

Link:

Copy

Embed:

<iframe width="560" height="315" src="https://www.docslides.com/embed/677267" frameborder="0" allowfullscreen></iframe>

Download Presentation from below link

Download Presentation The PPT/PDF document "Annealing Paths for the Evaluation of To..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.

Presentation Transcript

Slide1

Annealing Paths for the Evaluation of Topic Models

James FouldsPadhraic SmythDepartment of Computer ScienceUniversity of California, Irvine*

*James

Foulds

has recently moved to the University of California, Santa CruzSlide2

Motivation

Topic model extensionsStructure, prior knowledge and constraintsSparse, nonparametric, correlated, tree-structured, time series, supervised, focused, determinantal…Special-purpose modelsAuthorship, scientific impact, political affiliation, conversational influence, networks, machine translation…General-purpose modelsDirichlet multinomial regression (DMR),

sparse additive generative (SAGE)…

Structural topic model (STM)

2Slide3

Motivation

Topic model extensions Structure, prior knowledge and constraintsSparse, nonparametric, correlated, tree-structured, time series, supervised, focused, determinantal…Special-purpose modelsAuthorship, scientific impact, political affiliation, conversational influence, networks, machine translation…General-purpose modelsDirichlet multinomial regression (DMR),

sparse additive generative (SAGE)…

Structural topic model (STM)

3Slide4

Motivation

sparse additive generative (SAGE), Structural topic model (STM), …

4Slide5

Motivation

Inference algorithms for topic modelsOptimizationEM, variational inference, collapsed variational inference,…SamplingCollapsed Gibbs sampling, Langevin dynamics,…

Scaling to ``big data’’Stochastic algorithms, distributed algorithms,

map reduce…

5Slide6

Motivation

Inference algorithms for topic modelsOptimizationEM, variational inference, collapsed variational inference,…SamplingCollapsed Gibbs sampling, Langevin dynamics,…

Scaling to ``big data’’Stochastic algorithms, distributed algorithms,

map reduce…

6Slide7

Motivation

Inference algorithms for topic modelsOptimizationEM, variational inference, collapsed variational inference,…SamplingCollapsed Gibbs sampling, Langevin

dynamics,…Scaling up to ``big data’’Stochastic algorithms, distributed algorithms,

map reduce, sparse data structures…

7Slide8

Motivation

Which existing techniques should we use?Is my new model/algorithm better than previous methods?8Slide9

Evaluating Topic Models

Training set

Test set

9Slide10

Evaluating Topic Models

Training set

Test set

Topic model

10Slide11

Evaluating Topic Models

Training set

Test set

Topic model

Predict:

11Slide12

Evaluating Topic Models

Training set

Test set

Topic model

Predict:

Log Pr

(

)

12Slide13

Evaluating Topic Models

Fitting these models only took a few hours on a

single

core

single core machine

reating this plot required a

cluster

(

Foulds

et al., 2013)Slide14

Why is this Difficult?

For every held-out document d, we need to estimateWe need to approximate possibly tens of thousands

of intractable sums/integrals!

14Slide15