edu.stanford.nlp.process

Class StripTagsProcessor<L,F>

    • Field Summary

      Fields 
      Modifier and Type Field and Description
      static java.util.Set<java.lang.String> blockTags
      Block-level HTML tags that are rendered with surrounding line breaks.
    • Constructor Summary

      Constructors 
      Constructor and Description
      StripTagsProcessor()
      Constructs a new StripTagsProcessor that doesn't mark line breaks.
      StripTagsProcessor(boolean markLineBreaks)
      Constructs a new StripTagProcessor that marks line breaks as specified.
    • Method Summary

      Methods 
      Modifier and Type Method and Description
      boolean getMarkLineBreaks()
      Returns whether the output of the processor will contain newline words ("\n") at the end of block-level tags.
      static void main(java.lang.String[] args)
      For internal debugging purposes only.
      java.util.List<Word> process(java.util.List<? extends Word> in)
      Returns a new Document with the same meta-data as in, and the same words except tags are stripped.
      void setMarkLineBreaks(boolean markLineBreaks)
      Sets whether the output of the processor will contain newline words ("\n") at the end of block-level tags.
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Field Detail

      • blockTags

        public static final java.util.Set<java.lang.String> blockTags
        Block-level HTML tags that are rendered with surrounding line breaks.
    • Constructor Detail

      • StripTagsProcessor

        public StripTagsProcessor()
        Constructs a new StripTagsProcessor that doesn't mark line breaks.
      • StripTagsProcessor

        public StripTagsProcessor(boolean markLineBreaks)
        Constructs a new StripTagProcessor that marks line breaks as specified.
    • Method Detail

      • getMarkLineBreaks

        public boolean getMarkLineBreaks()
        Returns whether the output of the processor will contain newline words ("\n") at the end of block-level tags.
        Returns:
        Whether the output of the processor will contain newline words ("\n") at the end of block-level tags.
      • setMarkLineBreaks

        public void setMarkLineBreaks(boolean markLineBreaks)
        Sets whether the output of the processor will contain newline words ("\n") at the end of block-level tags.
      • process

        public java.util.List<Word> process(java.util.List<? extends Word> in)
        Returns a new Document with the same meta-data as in, and the same words except tags are stripped.
      • main

        public static void main(java.lang.String[] args)
        For internal debugging purposes only.

Stanford NLP Group