edu.stanford.nlp.international.arabic
Class Buckwalter
java.lang.Object
edu.stanford.nlp.international.arabic.Buckwalter
- All Implemented Interfaces:
- SerializableFunction<java.lang.String,java.lang.String>, Function<java.lang.String,java.lang.String>, java.io.Serializable
public class Buckwalter
- extends java.lang.Object
- implements SerializableFunction<java.lang.String,java.lang.String>
This class can convert between Unicode and Buckwalter encodings of
Arabic.
- Author:
- Christopher Manning
- See Also:
- Serialized Form
Nested Class Summary |
static class |
Buckwalter.TreeConverter
Converts Penn-style trees from Buckwalter to utf-8 Arabic and back. |
Method Summary |
java.lang.String |
apply(java.lang.String in)
Converts a T1 to a different T2. |
java.lang.String |
buckwalterToUnicode(java.lang.String in)
|
static void |
main(java.lang.String[] args)
|
java.lang.String |
unicodeToBuckwalter(java.lang.String in)
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Buckwalter
public Buckwalter()
Buckwalter
public Buckwalter(boolean unicodeToBuckwalter)
apply
public java.lang.String apply(java.lang.String in)
- Description copied from interface:
Function
- Converts a T1 to a different T2. For example, a Parser
will convert a Sentence to a Tree. A Tagger will convert a Sentence
to a TaggedSentence.
- Specified by:
apply
in interface Function<java.lang.String,java.lang.String>
- Parameters:
in
- The function's argument
- Returns:
- The function's evaluated value
buckwalterToUnicode
public java.lang.String buckwalterToUnicode(java.lang.String in)
unicodeToBuckwalter
public java.lang.String unicodeToBuckwalter(java.lang.String in)
main
public static void main(java.lang.String[] args)
throws java.io.IOException
- Throws:
java.io.IOException
Stanford NLP Group