Package opennlp.tools.formats
Class Conll02NameSampleStream
- java.lang.Object
-
- opennlp.tools.formats.Conll02NameSampleStream
-
- All Implemented Interfaces:
AutoCloseable,ObjectStream<NameSample>
@Internal public class Conll02NameSampleStream extends Object implements ObjectStream<NameSample>
Parser for the Dutch and Spanish ner training files of the CONLL 2002 shared task.The Dutch data has a
DOCSTARTtag to mark article boundaries, adaptive data in the feature generators will be cleared before every article.
The Spanish data does not contain article boundaries, adaptive data will be cleared for every sentence.The data contains four named entity types: Person, Organization, Location and Misc.
Data can be found on this web site.
Note: Do not use this class, internal use only!
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classConll02NameSampleStream.LANGUAGE
-
Field Summary
Fields Modifier and Type Field Description static StringDOCSTARTstatic intGENERATE_LOCATION_ENTITIESstatic intGENERATE_MISC_ENTITIESstatic intGENERATE_ORGANIZATION_ENTITIESstatic intGENERATE_PERSON_ENTITIES
-
Constructor Summary
Constructors Constructor Description Conll02NameSampleStream(Conll02NameSampleStream.LANGUAGE lang, InputStreamFactory in, int types)Initializes aConll02NameSampleStream.Conll02NameSampleStream(Conll02NameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types)Initializes aConll02NameSampleStream.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidclose()Closes theObjectStreamand releases all allocated resources.NameSampleread()Returns the nextObjectStreamobject.voidreset()Repositions the stream at the beginning and the previously seen object sequence will be repeated exactly.
-
-
-
Field Detail
-
GENERATE_PERSON_ENTITIES
public static final int GENERATE_PERSON_ENTITIES
- See Also:
- Constant Field Values
-
GENERATE_ORGANIZATION_ENTITIES
public static final int GENERATE_ORGANIZATION_ENTITIES
- See Also:
- Constant Field Values
-
GENERATE_LOCATION_ENTITIES
public static final int GENERATE_LOCATION_ENTITIES
- See Also:
- Constant Field Values
-
GENERATE_MISC_ENTITIES
public static final int GENERATE_MISC_ENTITIES
- See Also:
- Constant Field Values
-
DOCSTART
public static final String DOCSTART
- See Also:
- Constant Field Values
-
-
Constructor Detail
-
Conll02NameSampleStream
public Conll02NameSampleStream(Conll02NameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types)
Initializes aConll02NameSampleStream.- Parameters:
lang- The language of the CONLL 02 data.lineStream- AnObjectStreamover the lines in the CONLL 02 data file.types- The entity types to include in the Name Sample object stream.
-
Conll02NameSampleStream
public Conll02NameSampleStream(Conll02NameSampleStream.LANGUAGE lang, InputStreamFactory in, int types) throws IOException
Initializes aConll02NameSampleStream.- Parameters:
lang- The language of the CONLL 02 data.in- TheInputStreamFactoryfor the input file.types- The entity types to include in the Name Sample object stream.- Throws:
IOException- Thrown if IO errors occurred.
-
-
Method Detail
-
read
public NameSample read() throws IOException
Description copied from interface:ObjectStreamReturns the nextObjectStreamobject. Calling this method repeatedly until it returnsnullwill return each object from the underlying source exactly once.- Specified by:
readin interfaceObjectStream<NameSample>- Returns:
- The next object or
nullto signal that the stream is exhausted. - Throws:
IOException- Thrown if there is an error during reading.
-
reset
public void reset() throws IOException, UnsupportedOperationExceptionDescription copied from interface:ObjectStreamRepositions the stream at the beginning and the previously seen object sequence will be repeated exactly. This method can be used to re-read the stream if multiple passes over the objects are required.The implementation of this method is optional.
- Specified by:
resetin interfaceObjectStream<NameSample>- Throws:
IOException- Thrown if there is an error during resetting the stream.UnsupportedOperationException- Thrown if thereset()is not supported. By default, this is the case.
-
close
public void close() throws IOExceptionDescription copied from interface:ObjectStreamCloses theObjectStreamand releases all allocated resources. After close was called, it's not allowed to callObjectStream.read()orObjectStream.reset().- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceObjectStream<NameSample>- Throws:
IOException- Thrown if there is an error during closing the stream.
-
-