Class ADNameSampleStream

java.lang.Object
opennlp.tools.formats.ad.ADNameSampleStream
All Implemented Interfaces:
AutoCloseable, ObjectStream<NameSample>

@Internal public class ADNameSampleStream extends Object implements ObjectStream<NameSample>
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the Portuguese NER training.

The data contains four named entity types: Person, Organization, Group, Place, Event, ArtProd, Abstract, Thing, Time and Numeric.

Data can be found on this web site.

Information about the format:
Susana Afonso. "Árvores deitadas: Descrição do formato e das opções de análise na Floresta Sintáctica".
12 de Fevereiro de 2006.

Detailed info about the NER tagset.

Note: Do not use this class, internal use only!