LogConditionalObjectiveFunction (Stanford JavaNLP API)

java.lang.Object
- edu.stanford.nlp.optimization.AbstractCachingDiffFunction
- - edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction
  - - edu.stanford.nlp.optimization.AbstractStochasticCachingDiffUpdateFunction
    - - edu.stanford.nlp.classify.LogConditionalObjectiveFunction<L,F>

All Implemented Interfaces:

DiffFunction, Function, HasInitial

Direct Known Subclasses:

AdaptedGaussianPriorObjectiveFunction
```
public class LogConditionalObjectiveFunction<L,F>
extends AbstractStochasticCachingDiffUpdateFunction
```
Maximizes the conditional likelihood with a given prior.

Author:

Dan Klein, Galen Andrew, Chris Cox (merged w/ SumConditionalObjectiveFunction, 2/16/05), Sarah Spikes (Templatization, allowing an Iterable<Datum<L, F>> to be passed in instead of a GeneralDataset<L, F>), Angel Chang (support in place SGD - extend AbstractStochasticCachingDiffUpdateFunction), Christopher Manning (cleaned out the cruft and sped it up in 2014), Keenon Werling added some multithreading to the batch evaluations

Nested Class Summary
- Nested classes/interfaces inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction
  AbstractStochasticCachingDiffFunction.SamplingMethod

Field Summary

Fields
Modifier and Type	Field and Description
`protected int[][]`	`data` Normally, this contains the data.
`protected java.lang.Iterable<Datum<L,F>>`	`dataIterable` Alternatively, the data may be available from an Iterable in not yet indexed form.
`protected float[]`	`dataWeights`
`protected double[]`	`derivativeNumerator` This is used to cache the numerator in batch methods.
`protected Index<F>`	`featureIndex`
`protected Index<L>`	`labelIndex`
`protected int[]`	`labels` The label of each data index.
`protected int`	`numClasses`
`protected int`	`numFeatures`
`protected boolean`	`parallelGradientCalculation` The flag to tell the gradient computations to multithread over the data.
`protected LogPrior`	`prior`
`protected double[]`	`priorDerivative` The only reason this is around is because the Prior Functions don't handle stochastic calculations yet.
`protected int`	`threads` Multithreading gradient calculations is a bit cheaper if you reuse the threads.
`protected boolean`	`useSummedConditionalLikelihood`
`protected double[][]`	`values` Same size as data if the features have values; null if the features are binary.

Fields inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffUpdateFunction
skipValCalc

Fields inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction
allIndices, curElement, finiteDifferenceStepSize, gradPerturbed, hasNewVals, HdotV, lastBatch, lastBatchSize, lastElement, lastVBatch, lastXBatch, method, randGenerator, recalculatePrevBatch, returnPreviousValues, sampleMethod, scaleUp, thisBatch, xPerturbed

Fields inherited from class edu.stanford.nlp.optimization.AbstractCachingDiffFunction
derivative, value

Constructor Summary

Constructors
Constructor and Description
`LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset)`
`LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset, float[] dataWeights, LogPrior prior)`
`LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset, LogPrior prior)`
`LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset, LogPrior prior, boolean useSumCondObjFun)`
`LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset, LogPrior prior, boolean useSumCondObjFun, float[] dataWeights)` Version passing in a GeneralDataset, which may be binary or real-valued features.
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, double[][] values, int[] labels, int intPrior, double sigma, double epsilon)` For real-valued features.
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels)`
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels, boolean useSumCondObjFun)`
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels, float[] dataWeights)`
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels, float[] dataWeights, LogPrior prior)`
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels, float[] dataWeights, LogPrior prior, boolean useSummedConditionalLikelihood)`
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels, int intPrior, double sigma, double epsilon)`
`LogConditionalObjectiveFunction(int numFeatures, int numClasses, int[][] data, int[] labels, LogPrior prior)`
`LogConditionalObjectiveFunction(java.lang.Iterable<Datum<L,F>> dataIterable, LogPrior logPrior, Index<F> featureIndex, Index<L> labelIndex)` Version where an Iterable is passed in for the data.

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`protected void`	`calculate(double[] x)` Calculate the conditional likelihood.
`void`	`calculateStochastic(double[] x, double[] v, int[] batch)` This function is used to come up with an estimate of the value / gradient based on only a small portion of the data (referred to as the batchSize for lack of a better term.
`protected void`	`calculateStochasticAlgorithmicDifferentiation(double[] x, double[] v, int[] batch)`
`void`	`calculateStochasticFiniteDifference(double[] x, double[] v, double h, int[] batch)`
`void`	`calculateStochasticGradient(double[] x, int[] batch)` Performs stochastic gradient calculation based on samples indexed by batch and does not apply regularization.
`void`	`calculateStochasticGradientLocal(double[] x, int[] batch)`
`double`	`calculateStochasticUpdate(double[] x, double xscale, int[] batch, double gain)` Performs stochastic update of weights x (scaled by xScale) based on samples indexed by batch.
`int`	`dataDimension()` Data dimension must return the size of the data used by the function.
`int`	`domainDimension()` Returns the number of dimensions in the function's domain
`protected int`	`indexOf(int f, int c)` Converts a Phi feature number and class index into an f(x,y) feature index.
`protected void`	`rvfcalculate(double[] x)` Calculate conditional likelihood for datasets with real-valued features.
`double[][]`	`to2D(double[] x)`
`double`	`valueAt(double[] x, double xscale, int[] batch)` Computes value of function for specified value of x (scaled by xScale) only over samples indexed by batch.

Methods inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffUpdateFunction
calculateStochasticGradient, calculateStochasticUpdate, getSample, valueAt

Methods inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction
clearCache, decrementBatch, derivativeAt, derivativeAt, getBatch, HdotVAt, HdotVAt, HdotVAt, incrementBatch, incrementRandom, initial, lastDerivative, lastValue, scaleUp, valueAt, valueAt

Methods inherited from class edu.stanford.nlp.optimization.AbstractCachingDiffFunction
copy, derivativeAt, ensure, getDerivative, gradientCheck, gradientCheck, randomInitial, valueAt

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail
- prior
```
protected final LogPrior prior
```
- numFeatures
```
protected final int numFeatures
```
- numClasses
```
protected final int numClasses
```
- data
```
protected final int[][] data
```
  Normally, this contains the data. The first index is the datum number, and then there is an array of feature indices for each datum.
- dataIterable
```
protected final java.lang.Iterable<Datum<L,F>> dataIterable
```
  Alternatively, the data may be available from an Iterable in not yet indexed form. (In 2014, it's not clear any code actually uses this option.) And then you need an index for both.
- labelIndex
```
protected final Index<L> labelIndex
```
- featureIndex
```
protected final Index<F> featureIndex
```
- values
```
protected final double[][] values
```
  Same size as data if the features have values; null if the features are binary.
- labels
```
protected final int[] labels
```
  The label of each data index.
- dataWeights
```
protected final float[] dataWeights
```
- useSummedConditionalLikelihood
```
protected final boolean useSummedConditionalLikelihood
```
- derivativeNumerator
```
protected double[] derivativeNumerator
```
  This is used to cache the numerator in batch methods.
- priorDerivative
```
protected double[] priorDerivative
```
  The only reason this is around is because the Prior Functions don't handle stochastic calculations yet.
- parallelGradientCalculation
```
protected boolean parallelGradientCalculation
```
  The flag to tell the gradient computations to multithread over the data. keenon (june 2015): On my machine,
- threads
```
protected int threads
```
  Multithreading gradient calculations is a bit cheaper if you reuse the threads.

Constructor Detail

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset,
                                       LogPrior prior)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset,
                                       float[] dataWeights,
                                       LogPrior prior)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset,
                                       LogPrior prior,
                                       boolean useSumCondObjFun)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(GeneralDataset<L,F> dataset,
                                       LogPrior prior,
                                       boolean useSumCondObjFun,
                                       float[] dataWeights)

Version passing in a GeneralDataset, which may be binary or real-valued features.

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(java.lang.Iterable<Datum<L,F>> dataIterable,
                                       LogPrior logPrior,
                                       Index<F> featureIndex,
                                       Index<L> labelIndex)

Version where an Iterable is passed in for the data. Doesn't support dataWeights.

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels,
                                       boolean useSumCondObjFun)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels,
                                       LogPrior prior)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels,
                                       float[] dataWeights)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels,
                                       float[] dataWeights,
                                       LogPrior prior)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels,
                                       float[] dataWeights,
                                       LogPrior prior,
                                       boolean useSummedConditionalLikelihood)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       int[] labels,
                                       int intPrior,
                                       double sigma,
                                       double epsilon)

LogConditionalObjectiveFunction

public LogConditionalObjectiveFunction(int numFeatures,
                                       int numClasses,
                                       int[][] data,
                                       double[][] values,
                                       int[] labels,
                                       int intPrior,
                                       double sigma,
                                       double epsilon)

For real-valued features. Passing in processed data set.

Method Detail
- domainDimension
```
public int domainDimension()
```
  Description copied from interface: Function
  
  Returns the number of dimensions in the function's domain
  
  Returns:
  
  the number of domain dimensions
- dataDimension
```
public int dataDimension()
```
  Description copied from class: AbstractStochasticCachingDiffFunction
  
  Data dimension must return the size of the data used by the function.
  
  Specified by:
  
  dataDimension in class AbstractStochasticCachingDiffFunction
- indexOf
```
protected int indexOf(int f,
                      int c)
```
  Converts a Phi feature number and class index into an f(x,y) feature index.
- to2D
```
public double[][] to2D(double[] x)
```
- calculate
```
protected void calculate(double[] x)
```
  Calculate the conditional likelihood. If useSummedConditionalLikelihood is false (the default), this calculates standard(product) CL, otherwise this calculates summed CL. What's the difference? See Klein and Manning's 2002 EMNLP paper.
  
  Specified by:
  
  calculate in class AbstractCachingDiffFunction
  
  Parameters:
  
  x - The point at which to calculate the function
- calculateStochastic
```
public void calculateStochastic(double[] x,
                                double[] v,
                                int[] batch)
```
  This function is used to come up with an estimate of the value / gradient based on only a small portion of the data (referred to as the batchSize for lack of a better term. In this case batch does not mean All!! It should be thought of in the sense of "a small batch of the data".
  
  Specified by:
  
  calculateStochastic in class AbstractStochasticCachingDiffFunction
  
  Parameters:
  
  x - Value to evaluate at
  
  v - The vector for the Hessian vector product H.v
  
  batch - An array containing the indices of the data to use in the calculation, this array is being calculated internal to the abstract, and only needs to be handled not generated by the implementation.
- calculateStochasticFiniteDifference
```
public void calculateStochasticFiniteDifference(double[] x,
                                                double[] v,
                                                double h,
                                                int[] batch)
```
- calculateStochasticGradientLocal
```
public void calculateStochasticGradientLocal(double[] x,
                                             int[] batch)
```
- valueAt
```
public double valueAt(double[] x,
                      double xscale,
                      int[] batch)
```
  Description copied from class: AbstractStochasticCachingDiffUpdateFunction
  
  Computes value of function for specified value of x (scaled by xScale) only over samples indexed by batch.
  
  Specified by:
  
  valueAt in class AbstractStochasticCachingDiffUpdateFunction
  
  Parameters:
  
  x - unscaled weights
  
  xscale - how much to scale x by when performing calculations
  
  batch - indices of which samples to compute function over
  
  Returns:
  
  value of function at specified x (scaled by xScale) for samples
- calculateStochasticUpdate
```
public double calculateStochasticUpdate(double[] x,
                                        double xscale,
                                        int[] batch,
                                        double gain)
```
  Description copied from class: AbstractStochasticCachingDiffUpdateFunction
  
  Performs stochastic update of weights x (scaled by xScale) based on samples indexed by batch.
  
  Specified by:
  
  calculateStochasticUpdate in class AbstractStochasticCachingDiffUpdateFunction
  
  Parameters:
  
  x - unscaled weights
  
  xscale - how much to scale x by when performing calculations
  
  batch - indices of which samples to compute function over
  
  gain - how much to scale adjustments to x
  
  Returns:
  
  value of function at specified x (scaled by xScale) for samples
- calculateStochasticGradient
```
public void calculateStochasticGradient(double[] x,
                                        int[] batch)
```
  Description copied from class: AbstractStochasticCachingDiffUpdateFunction
  
  Performs stochastic gradient calculation based on samples indexed by batch and does not apply regularization. Does not update the parameter values. Typically stores derivative information for later access.
  
  Specified by:
  
  calculateStochasticGradient in class AbstractStochasticCachingDiffUpdateFunction
  
  Parameters:
  
  x - Unscaled weights
  
  batch - Indices of which samples to compute function over
- calculateStochasticAlgorithmicDifferentiation
```
protected void calculateStochasticAlgorithmicDifferentiation(double[] x,
                                                             double[] v,
                                                             int[] batch)
```
- rvfcalculate
```
protected void rvfcalculate(double[] x)
```
  Calculate conditional likelihood for datasets with real-valued features. Currently this can calculate CL only (no support for SCL). TODO: sum-conditional obj. fun. with RVFs.

Class LogConditionalObjectiveFunction<L,F>

Nested Class Summary

Nested classes/interfaces inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction

Field Summary

Fields inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffUpdateFunction

Fields inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction

Fields inherited from class edu.stanford.nlp.optimization.AbstractCachingDiffFunction

Constructor Summary

Method Summary

Methods inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffUpdateFunction

Methods inherited from class edu.stanford.nlp.optimization.AbstractStochasticCachingDiffFunction

Methods inherited from class edu.stanford.nlp.optimization.AbstractCachingDiffFunction

Methods inherited from class java.lang.Object

Field Detail

prior

numFeatures

numClasses

data

dataIterable

labelIndex

featureIndex

values

labels

dataWeights

useSummedConditionalLikelihood

derivativeNumerator

priorDerivative

parallelGradientCalculation

threads

Constructor Detail

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

LogConditionalObjectiveFunction

Method Detail

domainDimension

dataDimension

indexOf

to2D

calculate

calculateStochastic

calculateStochasticFiniteDifference

calculateStochasticGradientLocal

valueAt

calculateStochasticUpdate

calculateStochasticGradient

calculateStochasticAlgorithmicDifferentiation

rvfcalculate