Class ADChunkSampleStream
java.lang.Object
opennlp.tools.formats.ad.ADChunkSampleStream
- All Implemented Interfaces:
AutoCloseable, opennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample>
@Internal
public class ADChunkSampleStream
extends Object
implements opennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample>
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the
Portuguese Chunker training.
The heuristic to extract chunks where based o paper 'A Machine Learning
Approach to Portuguese Clause Identification', (Eraldo Fernandes, Cicero
Santos and Ruy Milidiú).
Data can be found on this web site.
Information about the format:
Susana Afonso.
"Árvores deitadas: Descrição do formato e das opções de análise na Floresta Sintáctica".
12 de Fevereiro de 2006.
Detailed info about the NER tagset.
Note: Do not use this class, internal use only!
-
Field Summary
Fields -
Constructor Summary
ConstructorsConstructorDescriptionADChunkSampleStream(opennlp.tools.util.InputStreamFactory in, String charsetName) Instantiates aADChunkSampleStreamstream from anInputStreamFactory.ADChunkSampleStream(opennlp.tools.util.ObjectStream<String> lineStream) Instantiates aADChunkSampleStreamstream fromopennlp.tools.util.ObjectStream<String>, that could be aPlainTextByLineStreamobject. -
Method Summary
-
Field Details
-
OTHER
- See Also:
-
-
Constructor Details
-
ADChunkSampleStream
Instantiates aADChunkSampleStreamstream fromopennlp.tools.util.ObjectStream<String>, that could be aPlainTextByLineStreamobject.- Parameters:
lineStream- Anopennlp.tools.util.ObjectStream<String>as input.
-
ADChunkSampleStream
public ADChunkSampleStream(opennlp.tools.util.InputStreamFactory in, String charsetName) throws IOException Instantiates aADChunkSampleStreamstream from anInputStreamFactory.- Parameters:
in- TheInputStreamFactoryfor the corpus.charsetName- Thecharsetto use for reading of the corpus.- Throws:
IOException
-
-
Method Details
-
read
- Specified by:
readin interfaceopennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample>- Throws:
IOException
-
convertFuncTag
-
setStart
public void setStart(int aStart) -
setEnd
public void setEnd(int aEnd) -
reset
- Specified by:
resetin interfaceopennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample>- Throws:
IOExceptionUnsupportedOperationException
-
close
- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceopennlp.tools.util.ObjectStream<opennlp.tools.chunker.ChunkSample>- Throws:
IOException
-