Class BioNLP2004NameSampleStream

java.lang.Object
opennlp.tools.formats.BioNLP2004NameSampleStream
All Implemented Interfaces:
AutoCloseable, opennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>

@Internal public class BioNLP2004NameSampleStream extends Object implements opennlp.tools.util.ObjectStream<opennlp.tools.namefind.NameSample>
A sample stream for the training files of the BioNLP/NLPBA 2004 shared task.

The data contains five named entity types:

  • DNA
  • RNA
  • protein
  • cell_type
  • cell_line

Data can be found on this website, or in this GitHub repository.

The BioNLP/NLPBA 2004 data were originally published here,

yet this page was gone when last checked in December 2022.

Note: Do not use this class, internal use only!

  • Field Details

    • GENERATE_DNA_ENTITIES

      public static final int GENERATE_DNA_ENTITIES
      See Also:
    • GENERATE_PROTEIN_ENTITIES

      public static final int GENERATE_PROTEIN_ENTITIES
      See Also:
    • GENERATE_CELLTYPE_ENTITIES

      public static final int GENERATE_CELLTYPE_ENTITIES
      See Also:
    • GENERATE_CELLLINE_ENTITIES

      public static final int GENERATE_CELLLINE_ENTITIES
      See Also:
    • GENERATE_RNA_ENTITIES

      public static final int GENERATE_RNA_ENTITIES
      See Also:
  • Constructor Details

    • BioNLP2004NameSampleStream

      public BioNLP2004NameSampleStream(opennlp.tools.util.InputStreamFactory in, int types) throws IOException
      Parameters:
      in - The InputStreamFactory to use.
      types - The types to use.
      Throws:
      IOException - Thrown if IO errors occurred during initialization.
  • Method Details