Package opennlp.tools.formats.masc
Class MascDocumentStream
java.lang.Object
opennlp.tools.formats.masc.MascDocumentStream
- All Implemented Interfaces:
- AutoCloseable,- ObjectStream<MascDocument>
- 
Constructor SummaryConstructorsConstructorDescriptionMascDocumentStream(File mascCorpusDirectory) MascDocumentStream(File mascCorpusDirectory, boolean searchRecursive, FileFilter fileFilter) Creates a MascDocumentStream to read the documents from a given directory.
- 
Method Summary
- 
Constructor Details- 
MascDocumentStream- Throws:
- IOException
 
- 
MascDocumentStreampublic MascDocumentStream(File mascCorpusDirectory, boolean searchRecursive, FileFilter fileFilter) throws IOException Creates a MascDocumentStream to read the documents from a given directory. Works iff all annotation files mentioned in the headers are present.- Parameters:
- mascCorpusDirectory- the directory containing all the MASC files
- searchRecursive- whether the search should go through subdirectories
- fileFilter- a custom file filter to filter out some files or null to accept anything
- Throws:
- IOException- if any stage of the stream creation fails
 
 
- 
- 
Method Details- 
resetpublic void reset()Reset the reading of all documents to the first sentence. Reset the corpus to the first document.- Specified by:
- resetin interface- ObjectStream<MascDocument>
 
- 
readReturn the next document. Client needs to check if this document has the necessary annotations.- Specified by:
- readin interface- ObjectStream<MascDocument>
- Returns:
- A corpus document with all its annotations.
- Throws:
- IOException- if anything goes wrong.
 
- 
closepublic void close()Remove the corpus from the memory.- Specified by:
- closein interface- AutoCloseable
- Specified by:
- closein interface- ObjectStream<MascDocument>
 
 
-