Package opennlp.tools.formats
Klasse Conll02NameSampleStream
java.lang.Object
opennlp.tools.formats.Conll02NameSampleStream
- Alle implementierten Schnittstellen:
AutoCloseable
,ObjectStream<NameSample>
Parser for the Dutch and Spanish ner training files of the CONLL 2002 shared task.
The Dutch data has a DOCSTART
tag to mark article boundaries,
adaptive data in the feature generators will be cleared before every article.
The Spanish data does not contain article boundaries,
adaptive data will be cleared for every sentence.
The data contains four named entity types: Person, Organization, Location and Misc.
Data can be found on this web site.
Note: Do not use this class, internal use only!
-
Verschachtelte Klassen - Übersicht
Verschachtelte Klassen -
Feldübersicht
FelderModifizierer und TypFeldBeschreibungstatic final String
static final int
static final int
static final int
static final int
-
Konstruktorübersicht
KonstruktorenKonstruktorBeschreibungConll02NameSampleStream
(Conll02NameSampleStream.LANGUAGE lang, InputStreamFactory in, int types) Initializes aConll02NameSampleStream
.Conll02NameSampleStream
(Conll02NameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types) Initializes aConll02NameSampleStream
. -
Methodenübersicht
Modifizierer und TypMethodeBeschreibungvoid
close()
Closes theObjectStream
and releases all allocated resources.read()
Returns the nextObjectStream
object.void
reset()
Repositions the stream at the beginning and the previously seen object sequence will be repeated exactly.
-
Felddetails
-
GENERATE_PERSON_ENTITIES
public static final int GENERATE_PERSON_ENTITIES- Siehe auch:
-
GENERATE_ORGANIZATION_ENTITIES
public static final int GENERATE_ORGANIZATION_ENTITIES- Siehe auch:
-
GENERATE_LOCATION_ENTITIES
public static final int GENERATE_LOCATION_ENTITIES- Siehe auch:
-
GENERATE_MISC_ENTITIES
public static final int GENERATE_MISC_ENTITIES- Siehe auch:
-
DOCSTART
- Siehe auch:
-
-
Konstruktordetails
-
Conll02NameSampleStream
public Conll02NameSampleStream(Conll02NameSampleStream.LANGUAGE lang, ObjectStream<String> lineStream, int types) Initializes aConll02NameSampleStream
.- Parameter:
lang
- The language of the CONLL 02 data.lineStream
- AnObjectStream<String>
over the lines in the CONLL 02 data file.types
- The entity types to include in the Name Sample object stream.
-
Conll02NameSampleStream
public Conll02NameSampleStream(Conll02NameSampleStream.LANGUAGE lang, InputStreamFactory in, int types) throws IOException Initializes aConll02NameSampleStream
.- Parameter:
lang
- The language of the CONLL 02 data.in
- TheInputStreamFactory
for the input file.types
- The entity types to include in the Name Sample object stream.- Löst aus:
IOException
- Thrown if IO errors occurred.
-
-
Methodendetails
-
read
Beschreibung aus Schnittstelle kopiert:ObjectStream
Returns the nextObjectStream
object. Calling this method repeatedly until it returnsnull
will return each object from the underlying source exactly once.- Angegeben von:
read
in SchnittstelleObjectStream<NameSample>
- Gibt zurück:
- The next object or
null
to signal that the stream is exhausted. - Löst aus:
IOException
- Thrown if there is an error during reading.
-
reset
Beschreibung aus Schnittstelle kopiert:ObjectStream
Repositions the stream at the beginning and the previously seen object sequence will be repeated exactly. This method can be used to re-read the stream if multiple passes over the objects are required.The implementation of this method is optional.
- Angegeben von:
reset
in SchnittstelleObjectStream<NameSample>
- Löst aus:
IOException
- Thrown if there is an error during resetting the stream.UnsupportedOperationException
- Thrown if thereset()
is not supported. By default, this is the case.
-
close
Beschreibung aus Schnittstelle kopiert:ObjectStream
Closes theObjectStream
and releases all allocated resources. After close was called, it's not allowed to callObjectStream.read()
orObjectStream.reset()
.- Angegeben von:
close
in SchnittstelleAutoCloseable
- Angegeben von:
close
in SchnittstelleObjectStream<NameSample>
- Löst aus:
IOException
- Thrown if there is an error during closing the stream.
-