Package opennlp.tools.formats.masc
Class MascDocumentStream
java.lang.Object
opennlp.tools.formats.masc.MascDocumentStream
- All Implemented Interfaces:
AutoCloseable
,ObjectStream<MascDocument>
-
Constructor Summary
ConstructorDescriptionMascDocumentStream
(File mascCorpusDirectory) MascDocumentStream
(File mascCorpusDirectory, boolean searchRecursive, FileFilter fileFilter) Creates a MascDocumentStream to read the documents from a given directory. -
Method Summary
-
Constructor Details
-
MascDocumentStream
- Throws:
IOException
-
MascDocumentStream
public MascDocumentStream(File mascCorpusDirectory, boolean searchRecursive, FileFilter fileFilter) throws IOException Creates a MascDocumentStream to read the documents from a given directory. Works iff all annotation files mentioned in the headers are present.- Parameters:
mascCorpusDirectory
- the directory containing all the MASC filessearchRecursive
- whether the search should go through subdirectoriesfileFilter
- a custom file filter to filter out some files or null to accept anything- Throws:
IOException
- if any stage of the stream creation fails
-
-
Method Details
-
reset
public void reset()Reset the reading of all documents to the first sentence. Reset the corpus to the first document.- Specified by:
reset
in interfaceObjectStream<MascDocument>
-
read
Return the next document. Client needs to check if this document has the necessary annotations.- Specified by:
read
in interfaceObjectStream<MascDocument>
- Returns:
- A corpus document with all its annotations.
- Throws:
IOException
- if anything goes wrong.
-
close
public void close()Remove the corpus from the memory.- Specified by:
close
in interfaceAutoCloseable
- Specified by:
close
in interfaceObjectStream<MascDocument>
-