Class BioNLP2004NameSampleStream

java.lang.Object
opennlp.tools.formats.BioNLP2004NameSampleStream
All Implemented Interfaces:
AutoCloseable, ObjectStream<NameSample>

@Internal public class BioNLP2004NameSampleStream extends Object implements ObjectStream<NameSample>
A sample stream for the training files of the BioNLP/NLPBA 2004 shared task.

The data contains five named entity types:

  • DNA
  • RNA
  • protein
  • cell_type
  • cell_line

Data can be found on this website, or in this GitHub repository.

The BioNLP/NLPBA 2004 data were originally published here,

yet this page was gone when last checked in December 2022.

Note: Do not use this class, internal use only!