edu.stanford.nlp.trees.international.arabic
Class ATBTreeUtils

java.lang.Object
  extended by edu.stanford.nlp.trees.international.arabic.ATBTreeUtils

public class ATBTreeUtils
extends java.lang.Object

Various static convenience methods for processing Arabic parse trees.

Author:
Spence Green

Field Summary
static java.lang.String morphBoundary
           
static java.lang.String puncTag
           
static java.util.Set<java.lang.String> reservedWords
           
static java.lang.String segMarker
           
 
Method Summary
static java.lang.String escape(java.lang.String s)
          Escapes tokens from flat strings that are reserved for usage in the ATB.
static java.lang.String flattenTree(Tree t)
          Returns the string associated with the input parse tree.
static void main(java.lang.String[] args)
           
static java.lang.String taggedStringFromTree(Tree t, boolean removeEscaping, java.lang.String separator)
          Converts a parse tree into a string of tokens.
static java.lang.String unEscape(java.lang.String s)
          Reverts escaping from a flat string.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

segMarker

public static java.lang.String segMarker

morphBoundary

public static final java.lang.String morphBoundary
See Also:
Constant Field Values

puncTag

public static final java.lang.String puncTag
See Also:
Constant Field Values

reservedWords

public static final java.util.Set<java.lang.String> reservedWords
Method Detail

escape

public static java.lang.String escape(java.lang.String s)
Escapes tokens from flat strings that are reserved for usage in the ATB.

Parameters:
s - - An Arabic string
Returns:
A string with all reserved words replaced by the appropriate tokens

unEscape

public static java.lang.String unEscape(java.lang.String s)
Reverts escaping from a flat string.

Parameters:
s - - An Arabic string
Returns:
A string with all reserved words inserted into the appropriate locations

flattenTree

public static java.lang.String flattenTree(Tree t)
Returns the string associated with the input parse tree. Traces and ATB-specific escape sequences (e.g., "-RRB-" for ")") are removed.

Parameters:
t - - A parse tree
Returns:
The yield of the input parse tree

taggedStringFromTree

public static java.lang.String taggedStringFromTree(Tree t,
                                                    boolean removeEscaping,
                                                    java.lang.String separator)
Converts a parse tree into a string of tokens. Each token is a word and its POS tag separated by the delimiter specified by separator

Parameters:
t - - A parse tree
removeEscaping - - If true, remove LDC escape characters. Otherwise, leave them.
separator - Word/tag separator
Returns:
A string of tagged words

main

public static void main(java.lang.String[] args)


Stanford NLP Group