Class EvalitaNameSampleStream
- All Implemented Interfaces:
AutoCloseable, opennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>
The data does not contain article boundaries, adaptive data will be cleared for every sentence.
Named Entities are annotated in the IOB2 format (as used in CoNLL 2002 shared task)
The Named Entity tag consists of two parts: 1. The IOB2 tag: 'B' (for 'begin') denotes the first token of a Named Entity, I (for 'inside') is used for all other tokens in a Named Entity, and 'O' (for 'outside') is used for all other words; 2. The Entity type tag: PER (for Person), ORG (for Organization), GPE (for Geo-Political Entity), or LOC (for Location).
Each file consists of four columns separated by a blank, containing respectively the token, the Elsnet PoS-tag, the Adige news story to which the token belongs, and the Named Entity tag.
Data can be found on this web site.
Note: Do not use this class, internal use only!
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final Stringstatic final intstatic final intstatic final intstatic final int -
Constructor Summary
ConstructorsConstructorDescriptionEvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, opennlp.tools.util.InputStreamFactory in, int types) EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, opennlp.tools.util.ObjectStream<String> lineStream, int types) -
Method Summary
-
Field Details
-
DOCSTART
- See Also:
-
GENERATE_PERSON_ENTITIES
public static final int GENERATE_PERSON_ENTITIES- See Also:
-
GENERATE_ORGANIZATION_ENTITIES
public static final int GENERATE_ORGANIZATION_ENTITIES- See Also:
-
GENERATE_LOCATION_ENTITIES
public static final int GENERATE_LOCATION_ENTITIES- See Also:
-
GENERATE_GPE_ENTITIES
public static final int GENERATE_GPE_ENTITIES- See Also:
-
-
Constructor Details
-
EvalitaNameSampleStream
public EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, opennlp.tools.util.ObjectStream<String> lineStream, int types) -
EvalitaNameSampleStream
public EvalitaNameSampleStream(EvalitaNameSampleStream.LANGUAGE lang, opennlp.tools.util.InputStreamFactory in, int types) throws IOException - Throws:
IOException
-
-
Method Details
-
read
- Specified by:
readin interfaceopennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>- Throws:
IOException
-
reset
- Specified by:
resetin interfaceopennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>- Throws:
IOExceptionUnsupportedOperationException
-
close
- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceopennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>- Throws:
IOException
-