Class EvalitaNameSampleStream
- All Implemented Interfaces:
- AutoCloseable,- ObjectStream<NameSample>
The data does not contain article boundaries, adaptive data will be cleared for every sentence.
Named Entities are annotated in the IOB2 format (as used in CoNLL 2002 shared task)
The Named Entity tag consists of two parts: 1. The IOB2 tag: 'B' (for 'begin') denotes the first token of a Named Entity, I (for 'inside') is used for all other tokens in a Named Entity, and 'O' (for 'outside') is used for all other words; 2. The Entity type tag: PER (for Person), ORG (for Organization), GPE (for Geo-Political Entity), or LOC (for Location).
Each file consists of four columns separated by a blank, containing respectively the token, the Elsnet PoS-tag, the Adige news story to which the token belongs, and the Named Entity tag.
Data can be found on this web site.
Note: Do not use this class, internal use only!
- 
Nested Class SummaryNested Classes
- 
Field SummaryFieldsModifier and TypeFieldDescriptionstatic final Stringstatic final intstatic final intstatic final intstatic final int
- 
Constructor SummaryConstructorsConstructorDescriptionEvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, InputStreamFactory in, int types) EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types) 
- 
Method SummaryModifier and TypeMethodDescriptionvoidclose()Closes theObjectStreamand releases all allocated resources.read()Returns the nextObjectStreamobject.voidreset()Repositions the stream at the beginning and the previously seen object sequence will be repeated exactly.
- 
Field Details- 
GENERATE_PERSON_ENTITIESpublic static final int GENERATE_PERSON_ENTITIES- See Also:
 
- 
GENERATE_ORGANIZATION_ENTITIESpublic static final int GENERATE_ORGANIZATION_ENTITIES- See Also:
 
- 
GENERATE_LOCATION_ENTITIESpublic static final int GENERATE_LOCATION_ENTITIES- See Also:
 
- 
GENERATE_GPE_ENTITIESpublic static final int GENERATE_GPE_ENTITIES- See Also:
 
- 
DOCSTART- See Also:
 
 
- 
- 
Constructor Details- 
EvalitaNameSampleStreampublic EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types) 
- 
EvalitaNameSampleStreampublic EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, InputStreamFactory in, int types) throws IOException - Throws:
- IOException
 
 
- 
- 
Method Details- 
readDescription copied from interface:ObjectStreamReturns the nextObjectStreamobject. Calling this method repeatedly until it returnsnullwill return each object from the underlying source exactly once.- Specified by:
- readin interface- ObjectStream<NameSample>
- Returns:
- The next object or nullto signal that the stream is exhausted.
- Throws:
- IOException- Thrown if there is an error during reading.
 
- 
resetDescription copied from interface:ObjectStreamRepositions the stream at the beginning and the previously seen object sequence will be repeated exactly. This method can be used to re-read the stream if multiple passes over the objects are required.The implementation of this method is optional. - Specified by:
- resetin interface- ObjectStream<NameSample>
- Throws:
- IOException- Thrown if there is an error during resetting the stream.
- UnsupportedOperationException- Thrown if the- reset()is not supported. By default, this is the case.
 
- 
closeDescription copied from interface:ObjectStreamCloses theObjectStreamand releases all allocated resources. After close was called, it's not allowed to callObjectStream.read()orObjectStream.reset().- Specified by:
- closein interface- AutoCloseable
- Specified by:
- closein interface- ObjectStream<NameSample>
- Throws:
- IOException- Thrown if there is an error during closing the stream.
 
 
-