Topic models describe the frequency of topics in documents and text. A "topic" is a group of words which tend to occur together.
A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Intuitively, given that a document is about a particular topic, one would expect particular words to appear in the document more or less frequently: "dog" and "bone" will appear more often in documents about dogs, "cat" and "meow" will appear in documents about cats (source: wikipedia)
Generative models (i.e. the statistical models used for topic modelling)
- Latent Dirichlet Allocation (LDA)
- Hierarchical Dirichlet process (HDP)
Software / Libraries
- Mallet (Java)
- Stanford Topic Modeling Toolbox (software)
- Gensim – Topic Modelling for Humans
Related Tags :