edu.stanford.nlp.tmt.model.lda.GibbsLDA

Ordering

Alphabetic
By inheritance

Inherited

Hide All
Show all

GibbsLDA
HardAssignmentModel
LDA
DirichletTopicSmoothing
DirichletTermSmoothing
ClosedTopicSet
TopicModel
RepCheck
Stateful
AnyRef
Any

Visibility

Public
All

Instance Constructors

new GibbsLDA (params: LDAModelParams, seed: Long, inferParams: GibbsInferParams, log: (String) ⇒ Unit)

Value Members

def != (arg0: AnyRef): Boolean

Attributes
final
Definition Classes
AnyRef
def != (arg0: Any): Boolean

Attributes
final
Definition Classes
Any
def ## (): Int

Attributes
final
Definition Classes
AnyRef → Any
def == (arg0: AnyRef): Boolean

Attributes
final
Definition Classes
AnyRef
def == (arg0: Any): Boolean

Attributes
final
Definition Classes
Any
def asInstanceOf [T0] : T0

Attributes
final
Definition Classes
Any
var checkers : List[Function0[_]]

Attributes
protected
Definition Classes
RepCheck
def checkrep (): Unit

Assert invariants.
Assert invariants.

Attributes
protected final
Definition Classes
RepCheck
def clone (): AnyRef

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws()
def computeCrossEntropy (doc: LDADocumentParams): (Double, Int)

Computes the total cross-entropy of the terms in the second half of the document based on an estimate of theta from the terms in the fisrt half of the doucment.
Computes the total cross-entropy of the terms in the second half of the document based on an estimate of theta from the terms in the fisrt half of the doucment. Returns (sum crossEntropy, numTerms). This is used as the basis of computePerplexity.

Definition Classes
LDA
def computeLogPW (doc: GibbsLDADocument): Double

Computes the log probability for the current document.
Computes the log probability for the current document. This measure treats the assignment to theta and the model counts as observed. Returns sum_i P(w_i | theta*, beta*). Beta maps from (topic,term) to probability.

Definition Classes
LDA
def computePerplexity (docs: Traversable[LDADocumentParams]): Double

Computes the average per-word perplexity of the given dataset.
Computes the average per-word perplexity of the given dataset.

Definition Classes
LDA
val countTopic : Array[Int]

How many times each topic is seen overall.
How many times each topic is seen overall.

Definition Classes
HardAssignmentModel
val countTopicTerm : Array[Array[Int]]

How many times each term is seen in each topic.
How many times each term is seen in each topic.

Definition Classes
HardAssignmentModel
def create (dp: LDADocumentParams): GibbsLDADocument

Creates a document from the given document parameters.
Creates a document from the given document parameters.

Definition Classes
GibbsLDA → TopicModel
def eq (arg0: AnyRef): Boolean

Attributes
final
Definition Classes
AnyRef
def equals (arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize (): Unit

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws()
def getClass (): java.lang.Class[_]

Attributes
final
Definition Classes
AnyRef → Any
def getTopicTermDistribution (topic: String): Array[Double]

Returns the distribution over terms for the given topic.
Returns the distribution over terms for the given topic. The return value of this method is assumed to have already incorporated the corresponding getTermSmoothing to the appropriate extent.

Attributes
final
Definition Classes
ClosedTopicSet
def getTopicTermDistribution (topic: Int): Array[Double]

Returns the distribution over terms for the given topic.
Returns the distribution over terms for the given topic. The return value of this method is assumed to have already incorporated the corresponding getTermSmoothing to the appropriate extent.

Definition Classes
ClosedTopicSet
def hashCode (): Int

Definition Classes
AnyRef → Any
def infer (doc: GibbsLDADocument): Array[Double]

Does inference on the given document until convergence.
Does inference on the given document until convergence.

Definition Classes
GibbsLDA → LDA
def infer (doc: String): Array[Double]

Does inference on the given document until convergence.
Does inference on the given document until convergence.

Definition Classes
LDA
def infer (doc: LDADocumentParams): Array[Double]

Does inference on the given document until convergence.
Does inference on the given document until convergence.

Definition Classes
LDA
val inferParams : GibbsInferParams
def inferSampler : InferSampler

Gets a thread-local inference sampler.
val inferSamplerTL : ThreadLocal[InferSampler]

Attributes
protected
def isInstanceOf [T0] : Boolean

Attributes
final
Definition Classes
Any
val learnSampler : LearnSampler
val log : (String) ⇒ Unit

Where log messages go.
Where log messages go. Defaults to System.err.println.

Definition Classes
GibbsLDA → TopicModel
def ne (arg0: AnyRef): Boolean

Attributes
final
Definition Classes
AnyRef
def notify (): Unit

Attributes
final
Definition Classes
AnyRef
def notifyAll (): Unit

Attributes
final
Definition Classes
AnyRef
val numTerms : Int

The number of terms in the model.
The number of terms in the model.

Definition Classes
LDA → TopicModel
val numTopics : Int

The number of topics in the model.
The number of topics in the model.

Definition Classes
LDA → ClosedTopicSet
def pTopicTerm (topic: Int, term: Int): Double

Returns the probability of the given term in the given topic.
Returns the probability of the given term in the given topic.

Attributes
final
Definition Classes
HardAssignmentModel → ClosedTopicSet
def pTopicTerm (topic: String, term: String): Double

Returns the probability of the given term in the given topic.
Returns the probability of the given term in the given topic.

Definition Classes
ClosedTopicSet
val params : LDAModelParams

The parameters used to create this model.
The parameters used to create this model.

Definition Classes
GibbsLDA → LDA → TopicModel
def registerCheck (check: Function0[_]): Unit

Registers a function as a checker of invariants.
Registers a function as a checker of invariants.

Attributes
protected
Definition Classes
RepCheck
def reset (): Unit

Resets to the default state.
Resets to the default state.

Definition Classes
HardAssignmentModel → Stateful
def sampleInfer (doc: GibbsLDADocument): Unit
def sampleLearn (doc: GibbsLDADocument): Unit
val seed : Long
def state : HardAssignmentModelState

Gets the current state of this object.
Gets the current state of this object.

Definition Classes
HardAssignmentModel → Stateful
def state_= (state: HardAssignmentModelState): Unit

Sets the current state of this object.
Sets the current state of this object.

Definition Classes
HardAssignmentModel → Stateful
def summary : Iterator[String]

Returns human-readable summary of the current topic model.
Returns human-readable summary of the current topic model.

Definition Classes
HardAssignmentModel
def synchronized [T0] (arg0: ⇒ T0): T0

Attributes
final
Definition Classes
AnyRef
def termIndex : Option[Index[String]]

The term index describing which terms are in the model.
The term index describing which terms are in the model.

Attributes
final
Definition Classes
TopicModel
def termIndex_= (index: Option[Index[String]]): Unit

Attributes
protected final
Definition Classes
TopicModel
def termSmoothDenom : Double

Attributes
protected
Definition Classes
DirichletTermSmoothing
def termSmoothing : Array[Double]

Add-k prior counts for each term (eta in the model formulation).
Add-k prior counts for each term (eta in the model formulation).

Attributes
final
Definition Classes
DirichletTermSmoothing
def termSmoothing_= (smoothing: Array[Double]): Unit

Attributes
protected
Definition Classes
DirichletTermSmoothing
def toString (): String

Definition Classes
AnyRef → Any
def tokenize (document: String): Iterable[Int]

Tokenizes the given input string using our stored tokenizer and term index, if available.
Tokenizes the given input string using our stored tokenizer and term index, if available. Otherwise, throws an IllegalArgumentException.

Attributes
protected
Definition Classes
TopicModel
def tokenizer : Option[Tokenizer]

The tokenizer used to break input documents into terms.
The tokenizer used to break input documents into terms.

Attributes
final
Definition Classes
TopicModel
def tokenizer_= (tokenizer: Option[Tokenizer]): Unit

Attributes
protected final
Definition Classes
TopicModel
var topicIndex : Option[Index[String]]

The term index describing which terms are in the model.
The term index describing which terms are in the model.

Definition Classes
ClosedTopicSet
def topicName (topic: Int): String

Gets the name for this topic.
Gets the name for this topic.

Definition Classes
ClosedTopicSet
def topicSmoothing : Array[Double]

Prior counts for each topic (alpha in the model formulation).
Prior counts for each topic (alpha in the model formulation).

Attributes
final
Definition Classes
DirichletTopicSmoothing
def topicSmoothing_= (smoothing: Array[Double]): Unit

Attributes
protected
Definition Classes
DirichletTopicSmoothing
def wait (): Unit

Attributes
final
Definition Classes
AnyRef
Annotations
@throws()
def wait (arg0: Long, arg1: Int): Unit

Attributes
final
Definition Classes
AnyRef
Annotations
@throws()
def wait (arg0: Long): Unit

Attributes
final
Definition Classes
AnyRef
Annotations
@throws()

GibbsLDA

class GibbsLDA extends LDA[HardAssignmentModelState, GibbsLDADocument, (String, Array[Int])] with HardAssignmentModel[LDAModelParams, LDADocumentParams, GibbsLDADocument]

Instance Constructors

new GibbsLDA (params: LDAModelParams, seed: Long, inferParams: GibbsInferParams, log: (String) ⇒ Unit)

Value Members

def != (arg0: AnyRef): Boolean

def != (arg0: Any): Boolean

def ## (): Int

def == (arg0: AnyRef): Boolean

def == (arg0: Any): Boolean

def asInstanceOf [T0] : T0

var checkers : List[Function0[_]]

def checkrep (): Unit

def clone (): AnyRef

def computeCrossEntropy (doc: LDADocumentParams): (Double, Int)

def computeLogPW (doc: GibbsLDADocument): Double

def computePerplexity (docs: Traversable[LDADocumentParams]): Double

val countTopic : Array[Int]

val countTopicTerm : Array[Array[Int]]

def create (dp: LDADocumentParams): GibbsLDADocument

def eq (arg0: AnyRef): Boolean

def equals (arg0: Any): Boolean

def finalize (): Unit

def getClass (): java.lang.Class[_]

def getTopicTermDistribution (topic: String): Array[Double]

def getTopicTermDistribution (topic: Int): Array[Double]

def hashCode (): Int

def infer (doc: GibbsLDADocument): Array[Double]

def infer (doc: String): Array[Double]

def infer (doc: LDADocumentParams): Array[Double]

val inferParams : GibbsInferParams

def inferSampler : InferSampler

val inferSamplerTL : ThreadLocal[InferSampler]

def isInstanceOf [T0] : Boolean

val learnSampler : LearnSampler

val log : (String) ⇒ Unit

def ne (arg0: AnyRef): Boolean

def notify (): Unit

def notifyAll (): Unit

val numTerms : Int

val numTopics : Int

def pTopicTerm (topic: Int, term: Int): Double

def pTopicTerm (topic: String, term: String): Double

val params : LDAModelParams

def registerCheck (check: Function0[_]): Unit

def reset (): Unit

def sampleInfer (doc: GibbsLDADocument): Unit

def sampleLearn (doc: GibbsLDADocument): Unit

val seed : Long

def state : HardAssignmentModelState

def state_= (state: HardAssignmentModelState): Unit

def summary : Iterator[String]

def synchronized [T0] (arg0: ⇒ T0): T0

def termIndex : Option[Index[String]]

def termIndex_= (index: Option[Index[String]]): Unit

def termSmoothDenom : Double

def termSmoothing : Array[Double]

def termSmoothing_= (smoothing: Array[Double]): Unit

def toString (): String

def tokenize (document: String): Iterable[Int]

def tokenizer : Option[Tokenizer]

def tokenizer_= (tokenizer: Option[Tokenizer]): Unit

var topicIndex : Option[Index[String]]

def topicName (topic: Int): String

def topicSmoothing : Array[Double]

def topicSmoothing_= (smoothing: Array[Double]): Unit

def wait (): Unit

def wait (arg0: Long, arg1: Int): Unit

def wait (arg0: Long): Unit

Inherited from HardAssignmentModel[LDAModelParams, LDADocumentParams, GibbsLDADocument]

Inherited from LDA[HardAssignmentModelState, GibbsLDADocument, (String, Array[Int])]

Inherited from DirichletTopicSmoothing

Inherited from DirichletTermSmoothing

Inherited from ClosedTopicSet

Inherited from TopicModel[LDAModelParams, HardAssignmentModelState, LDADocumentParams, GibbsLDADocument, (String, Array[Int])]

Inherited from RepCheck

Inherited from Stateful[HardAssignmentModelState]

Inherited from AnyRef

Inherited from Any