Class EvalitaNameSampleStream
- java.lang.Object
-
- opennlp.tools.formats.EvalitaNameSampleStream
-
- All Implemented Interfaces:
AutoCloseable,ObjectStream<NameSample>
public class EvalitaNameSampleStream extends Object implements ObjectStream<NameSample>
Parser for the Italian NER training files of the Evalita 2007 and 2009 NER shared tasks.The data does not contain article boundaries, adaptive data will be cleared for every sentence.
Named Entities are annotated in the IOB2 format (as used in CoNLL 2002 shared task)
The Named Entity tag consists of two parts: 1. The IOB2 tag: 'B' (for 'begin') denotes the first token of a Named Entity, I (for 'inside') is used for all other tokens in a Named Entity, and 'O' (for 'outside') is used for all other words; 2. The Entity type tag: PER (for Person), ORG (for Organization), GPE (for Geo-Political Entity), or LOC (for Location).
Each file consists of four columns separated by a blank, containing respectively the token, the Elsnet PoS-tag, the Adige news story to which the token belongs, and the Named Entity tag.
Data can be found on this web site:
http://www.evalita.itNote: Do not use this class, internal use only!
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classEvalitaNameSampleStream.LANGUAGE
-
Field Summary
Fields Modifier and Type Field Description static StringDOCSTARTstatic intGENERATE_GPE_ENTITIESstatic intGENERATE_LOCATION_ENTITIESstatic intGENERATE_ORGANIZATION_ENTITIESstatic intGENERATE_PERSON_ENTITIES
-
Constructor Summary
Constructors Constructor Description EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, InputStreamFactory in, int types)EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidclose()Closes theObjectStreamand releases all allocated resources.NameSampleread()Returns the next object.voidreset()Repositions the stream at the beginning and the previously seen object sequence will be repeated exactly.
-
-
-
Field Detail
-
GENERATE_PERSON_ENTITIES
public static final int GENERATE_PERSON_ENTITIES
- See Also:
- Constant Field Values
-
GENERATE_ORGANIZATION_ENTITIES
public static final int GENERATE_ORGANIZATION_ENTITIES
- See Also:
- Constant Field Values
-
GENERATE_LOCATION_ENTITIES
public static final int GENERATE_LOCATION_ENTITIES
- See Also:
- Constant Field Values
-
GENERATE_GPE_ENTITIES
public static final int GENERATE_GPE_ENTITIES
- See Also:
- Constant Field Values
-
DOCSTART
public static final String DOCSTART
- See Also:
- Constant Field Values
-
-
Constructor Detail
-
EvalitaNameSampleStream
public EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types)
-
EvalitaNameSampleStream
public EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, InputStreamFactory in, int types) throws IOException
- Throws:
IOException
-
-
Method Detail
-
read
public NameSample read() throws IOException
Description copied from interface:ObjectStreamReturns the next object. Calling this method repeatedly until it returns null will return each object from the underlying source exactly once.- Specified by:
readin interfaceObjectStream<NameSample>- Returns:
- the next object or null to signal that the stream is exhausted
- Throws:
IOException- if there is an error during reading
-
reset
public void reset() throws IOException, UnsupportedOperationExceptionDescription copied from interface:ObjectStreamRepositions the stream at the beginning and the previously seen object sequence will be repeated exactly. This method can be used to re-read the stream if multiple passes over the objects are required. The implementation of this method is optional.- Specified by:
resetin interfaceObjectStream<NameSample>- Throws:
IOException- if there is an error during reseting the streamUnsupportedOperationException
-
close
public void close() throws IOExceptionDescription copied from interface:ObjectStreamCloses theObjectStreamand releases all allocated resources. After close was called its not allowed to call read or reset.- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceObjectStream<NameSample>- Throws:
IOException- if there is an error during closing the stream
-
-