Class LanguageDetectorSampleStream

  • All Implemented Interfaces:
    AutoCloseable, ObjectStream<LanguageSample>

    public class LanguageDetectorSampleStream
    extends FilterObjectStream<String,​LanguageSample>
    This class reads in string encoded training samples, parses them and outputs LanguageSample objects.

    Format:
    Each line contains one sample document.
    The language is the first string in the line followed by a tab and the document content.
    Sample line: category-string tab-char document line-break-char(s)

    • Constructor Detail

      • LanguageDetectorSampleStream

        public LanguageDetectorSampleStream​(ObjectStream<String> samples)
    • Method Detail

      • read

        public LanguageSample read()
                            throws IOException
        Description copied from interface: ObjectStream
        Returns the next object. Calling this method repeatedly until it returns null will return each object from the underlying source exactly once.
        Returns:
        the next object or null to signal that the stream is exhausted
        Throws:
        IOException - if there is an error during reading