Package opennlp.tools.parser
Class Parse
java.lang.Object
opennlp.tools.parser.Parse
- All Implemented Interfaces:
Cloneable
,Comparable<Parse>
Data structure for holding parse constituents.
-
Field Summary
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
static void
Utility method to insert named entities.void
addNextPunctuation
(Parse punct) Designates that the specified punctuation follows this parse.void
addPreviousPunctuation
(Parse punct) Designates that the specified punctuation should is prior to this parse.void
addProb
(double logProb) Adds the specifiedlogProb
to this current log for this parse.Sister adjoins this node's last child and the specified sister node and returns their new parent node.adjoinRoot
(Parse node, HeadRules rules, int parseIndex) clone()
Clones the right frontier ofParse
up to the specifiednode
.Clones the right frontier of this rootParse
up to and including the specified node.int
boolean
complete()
boolean
void
expandTopNode
(Parse root) static void
fixPossesives
(Parse parse) int
Parse[]
getCommonParent
(Parse node) Returns the deepest shared parent of this node and the specified node.getHead()
int
getLabel()
Retrieves the label assigned to this parse node during parsing which specifies how this node will be formed into a constituent.double
getProb()
getSpan()
Parse[]
double
getText()
Parse[]
getType()
int
hashCode()
int
void
Inserts the specified constituent into this parse based on its text span.boolean
isChunk()
void
isChunk
(boolean ic) boolean
isFlat()
Indicates whether this parse node contains no sub-constituents.boolean
isPosTag()
Indicates whether this parse node is a pos-tag.static Parse
parseParse
(String parse) Parses the specified tree-bank style parse string and return aParse
structure for that string.static Parse
parseParse
(String parse, GapLabeler gl) static void
pruneParse
(Parse parse) Prune the specified sentence parse of vacuous productions.void
remove
(int index) void
Replaces the child at the specified index with a new child with the specified label.void
setDerivation
(StringBuffer derivation) Specifies the derivation string to be associated with this parse.void
Assigns this parse the specified label.void
setNextPunctuation
(Collection<Parse> punctSet) Sets thepunctuation tags
which follow this parse.void
Specifies the parent parse node for this constituent.void
setPrevPunctuation
(Collection<Parse> punctSet) Sets thepunctuation tags
which precede this parse.void
Set the type of this constituent to the specified type.void
show()
Prints this parse using Penn Treebank-style formatting.void
show
(StringBuffer sb) Fills the specifiedStringBuffer
with a string representation of this parse.void
Prints to standard out a representation of the specified parse which contains hash codes so that parent/child relationships can be explicitly seen.toString()
Represents thisParse
in a human-readable way.void
updateHeads
(HeadRules rules) Computes the head parses for this parse and its sub-parses and stores this information in the parse data structure.void
static void
useFunctionTags
(boolean uft) Specifies whether function tags should be included as part of the constituent type.
-
Field Details
-
BRACKET_LRB
- See Also:
-
BRACKET_RRB
- See Also:
-
BRACKET_LCB
- See Also:
-
BRACKET_RCB
- See Also:
-
BRACKET_LSB
- See Also:
-
BRACKET_RSB
- See Also:
-
-
Constructor Details
-
Parse
Initializes anode
for this specifiedtext
andspan
of the specifiedtype
with probabilityp
and the headindex
.- Parameters:
text
- The text of the sentence for which this node is a part of.span
- Thecharacter offsets
for this node within the specifiedtext
.type
- The constituent label of this node.p
- The probability of thisParse
.index
- The token index of the head of this parse.
-
Parse
Initializes anode
for this specifiedtext
andspan
of the specifiedtype
with probabilityp
and the headindex
.- Parameters:
text
- The text of the sentence for which this node is a part of.span
- Thecharacter offsets
for this node within the specifiedtext
.type
- The constituent label of this node.p
- The probability of this parse.h
- The head token of this parse.
-
-
Method Details
-
clone
-
clone
Clones the right frontier ofParse
up to the specifiednode
.- Parameters:
node
- The lastnode
in the right frontier of the parse tree to be cloned.- Returns:
- A clone of this parse and its right frontier up to and including the specified node.
-
cloneRoot
Clones the right frontier of this rootParse
up to and including the specified node.- Parameters:
node
- The lastnode
in the right frontier of the parse tree to be cloned.parseIndex
- The child index of the parse for this rootnode
.- Returns:
- A clone of this root parse and its right frontier up to and including the specified node.
-
useFunctionTags
public static void useFunctionTags(boolean uft) Specifies whether function tags should be included as part of the constituent type.- Parameters:
uft
-true
is they should be included,false
otherwise.
-
setType
Set the type of this constituent to the specified type.- Parameters:
type
- The type of this constituent.
-
getType
- Returns:
- Retrieves the constituent label for this node of the parse.
-
getPreviousPunctuationSet
- Returns:
- Retrieves the set of punctuation
parses
that occur immediately before this parse.
-
addPreviousPunctuation
Designates that the specified punctuation should is prior to this parse.- Parameters:
punct
- Thepunctuation
to be added.
-
getNextPunctuationSet
- Returns:
- Retrieves the set of punctuation
parses
that occur immediately after this parse.
-
addNextPunctuation
Designates that the specified punctuation follows this parse.- Parameters:
punct
- Thepunctuation
set.
-
setNextPunctuation
Sets thepunctuation tags
which follow this parse.- Parameters:
punctSet
- The punctuation tags which follow this parse.
-
setPrevPunctuation
Sets thepunctuation tags
which precede this parse.- Parameters:
punctSet
- The punctuation tags which precede this parse.
-
insert
Inserts the specified constituent into this parse based on its text span. This method assumes that the specified constituent can be inserted into this parse.- Parameters:
constituent
- Theconstituent
to be inserted.
-
show
Fills the specifiedStringBuffer
with a string representation of this parse.- Parameters:
sb
- AStringBuffer
into which the parse string can be appended.
-
show
public void show()Prints this parse using Penn Treebank-style formatting. -
getTagSequenceProb
public double getTagSequenceProb()- Returns:
- Retrieves the probability associated with the pos-tag sequence assigned to this parse.
-
complete
public boolean complete()- Returns:
true
if the parse contains a single top-most node (=complete),false
otherwise.
-
getCoveredText
-
toStringPennTreebank
- Returns:
- Retrieves a String representation using Penn Treebank-style formatting.
-
toString
Represents thisParse
in a human-readable way. -
getText
- Returns:
- Retrieves the text of the sentence over which this parse was formed.
-
getSpan
- Returns:
- Retrieves the
character offsets
for this constituent.
-
getProb
public double getProb()- Returns:
- Retrieves the
log
of the product of the probability associated with all the decisions which formed this constituent.
-
addProb
public void addProb(double logProb) Adds the specifiedlogProb
to this current log for this parse.- Parameters:
logProb
- The probability of an action performed on this parse.
-
getChildren
- Returns:
- Retrieves the
child constituents
of this constituent.
-
setChild
Replaces the child at the specified index with a new child with the specified label.- Parameters:
index
- The index of the child to be replaced.label
- The label to be assigned to the new child.
-
add
-
remove
public void remove(int index) -
adjoinRoot
-
adjoin
Sister adjoins this node's last child and the specified sister node and returns their new parent node. The new parent node replace this node's last child.- Parameters:
sister
- Thenode
to be adjoined.rules
- TheHeadRules
for the parser.- Returns:
- The new
parent node
of this node and the specified sister node.
-
expandTopNode
-
getChildCount
public int getChildCount()- Returns:
- Retrieves the number of children for this parse node.
-
indexOf
- Parameters:
child
- A child of this parse.- Returns:
- Retrieves the index of this specified child or
-1
if the specified child is not a child of this parse.
-
getHead
- Returns:
- Retrieves the head constituent associated with this constituent.
-
getHeadIndex
public int getHeadIndex()- Returns:
- Retrieves the index within a sentence of the head token for this parse.
-
getLabel
Retrieves the label assigned to this parse node during parsing which specifies how this node will be formed into a constituent.- Returns:
- The outcome label assigned to this node during parsing.
-
setLabel
Assigns this parse the specified label. This is used by parsing schemes to tag parsing nodes while building.- Parameters:
label
- A label indicating something about the stage of building for this parse node.
-
updateHeads
Computes the head parses for this parse and its sub-parses and stores this information in the parse data structure.- Parameters:
rules
- TheHeadRules
which determine how the head of the parse is computed.
-
updateSpan
public void updateSpan() -
pruneParse
Prune the specified sentence parse of vacuous productions.- Parameters:
parse
- The sentenceParse
.
-
fixPossesives
-
parseParse
Parses the specified tree-bank style parse string and return aParse
structure for that string. -
parseParse
- Parameters:
parse
- A tree-bank styleParse
string.gl
- TheGapLabeler
to be used.- Returns:
- A
Parse
structure for the specified tree-bank style parse string.
-
getParent
- Returns:
- Retrieves the parent parse node of this constituent.
-
setParent
Specifies the parent parse node for this constituent.- Parameters:
parent
- The parent parse node for this constituent.
-
isPosTag
public boolean isPosTag()Indicates whether this parse node is a pos-tag.- Returns:
true
if this node is a pos-tag,false
otherwise.
-
isFlat
public boolean isFlat()Indicates whether this parse node contains no sub-constituents.- Returns:
true
if this constituent contains no sub-constituents;false
otherwise.
-
isChunk
public void isChunk(boolean ic) -
isChunk
public boolean isChunk() -
getTagNodes
- Returns:
- Retrieves the parse nodes which are children of this node and which are pos tags.
-
getTokenNodes
-
getCommonParent
Returns the deepest shared parent of this node and the specified node. If the nodes are identical then their parent is returned. If one node is the parent of the other than the parent node is returned.- Parameters:
node
- The node from which parents are compared to this node's parents.- Returns:
- the deepest shared parent of this node and the specified node.
-
equals
-
hashCode
public int hashCode() -
compareTo
- Specified by:
compareTo
in interfaceComparable<Parse>
-
getDerivation
- Returns:
- Retrieves the derivation string for this parse or
null
if no derivation string has been created.
-
setDerivation
Specifies the derivation string to be associated with this parse.- Parameters:
derivation
- The derivation string to be associated with this parse.
-
showCodeTree
public void showCodeTree()Prints to standard out a representation of the specified parse which contains hash codes so that parent/child relationships can be explicitly seen. -
addNames
Utility method to insert named entities.
-