A B C D E F G H I J K L M N O P Q R S T U V W X

A

AbstractRepresentation - Class in de.fu_berlin.ties.context
Provides basic functionality shared by different representations.
AbstractRepresentation(int, int, int, String) - Constructor for class de.fu_berlin.ties.context.AbstractRepresentation
Creates a new instance.
accept(File) - Method in class de.fu_berlin.ties.io.ExtensionFilter
Tests whether or not the specified abstract pathname should be included in a pathname list.
Accuracy - Class in de.fu_berlin.ties.eval
Counts true and false items and measures the accuracy: A = true / (true + false).
Accuracy() - Constructor for class de.fu_berlin.ties.eval.Accuracy
Creates a new empty instance, without using a prefix.
Accuracy(String) - Constructor for class de.fu_berlin.ties.eval.Accuracy
Creates a new empty instance.
Accuracy(FieldMap) - Constructor for class de.fu_berlin.ties.eval.Accuracy
Creates a new instance from a field map, fulfilling the Storable contract.
Accuracy(FieldMap, String) - Constructor for class de.fu_berlin.ties.eval.Accuracy
Creates a new instance from a field map, using a prefix.
AccuracyView - Interface in de.fu_berlin.ties.eval
Provides a read-only view on Accuracy statistics and the underlying raw counts.
activeClasses() - Method in class de.fu_berlin.ties.combi.BeginAfterStrategy
Builds a set of class names (Strings) to pass to the classifier to consider for the next decision.
activeClasses() - Method in class de.fu_berlin.ties.combi.BeginEndStrategy
Builds a set array of class names (Strings) to pass to each classifier to consider for the next decision.
activeClasses() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Builds a set array of class names (Strings) to pass to each classifier to consider for the next decision.
activeClasses() - Method in class de.fu_berlin.ties.combi.InsideOutsideStrategy
Builds a set of class names (Strings) to pass to the classifier to consider for the next decision.
activeClasses() - Method in class de.fu_berlin.ties.combi.OpenCloseStrategy
Builds a set of class names (Strings) to pass to the classifier to consider for the next decision.
activeClasses() - Method in class de.fu_berlin.ties.combi.TrivialStrategy
Builds a set of class names (Strings) to pass to the classifier to consider for the next decision.
AdaptableLRUMap - Class in de.fu_berlin.ties.util
A fixed-size map that uses an flexible adaptable strategy for pruning entries based on LRU (pruning one the least recently used entries).
AdaptableLRUMap(int, Pruner, int, int) - Constructor for class de.fu_berlin.ties.util.AdaptableLRUMap
Constructs a new, empty map with the specified initial capacity and the default load factor.
AdaptableLRUMap(int, float, Pruner, int, int) - Constructor for class de.fu_berlin.ties.util.AdaptableLRUMap
Constructs a new, empty map with the specified initial capacity and load factor.
adaptKey(String, String) - Method in class de.fu_berlin.ties.TiesConfiguration
Creates a adapted (caller-specific) key by joining a base name with a suffix.
add(Feature) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Adds a feature to this vector.
add(Prediction) - Method in class de.fu_berlin.ties.classify.PredictionDistribution
Adds a new prediction to this distribution.
add(Recognition) - Method in class de.fu_berlin.ties.context.PriorRecognitions
Adds the given recognition to this buffer, discarding the least recently added recognition if appropriate.
add(Extraction) - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Adds an extraction to this container.
add(Extraction) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Adds an extraction to this container.
add(FieldMap) - Method in class de.fu_berlin.ties.io.FieldContainer
Adds a field map to this container.
add(Storable) - Method in class de.fu_berlin.ties.io.FieldContainer
Adds the representation of a Storable to this container, by calling its Storable.storeFields() method and adding the resulting field map.
add(FieldMap, boolean) - Method in class de.fu_berlin.ties.io.FieldContainer
Helper method for adding a field map to this container.
add(List) - Method in class de.fu_berlin.ties.io.FieldContainer
Adds a field map created from the specified values, using the n-th key from the set of all keys for the n-th specified value.
add(String) - Method in class de.fu_berlin.ties.text.TokenContainer
Adds text to this container.
add(boolean, String) - Method in class de.fu_berlin.ties.text.TokenCounter
Adds a token to this instance.
addAll(Collection) - Method in class de.fu_berlin.ties.classify.feature.DefaultFeatureVector
Adds all of the features in the specified Collection to this vector, in the order they are returned by the specified Collection's Iterator.
addAll(Collection) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Adds all of the features in the specified Collection to this vector, in the order they are returned by the specified Collection's Iterator.
addAll(FeatureVector) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Adds all of the features in the specified feature vector to this vector, in the order they are returned by the specified feature vector.
addAllAndClear(Collection, EvalStatus, String) - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Helper method that adds a bunch of extractions, setting their evaluation status and source as specified.
addAllTokens(CharSequence, TextTokenizer) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Tokenizes a text, creating and adding a feature for each token.
addConfiguration(Configuration, Configuration) - Method in class de.fu_berlin.ties.TiesConfiguration
Adds a configuration and a corresponding descriptor config.
addDescriptorConfig(Configuration) - Method in class de.fu_berlin.ties.TiesConfiguration
Adds a descriptor configuration that can be consulted to query the type and use of a entries.
addFeature(FeatureVector, String, String) - Method in class de.fu_berlin.ties.context.SimpleRepresentation
Creates a feature and adds it to a feature vector.
addKey(String) - Method in class de.fu_berlin.ties.io.FieldContainer
Helper method that adds a key to the set of all keys.
addProb(Probability, boolean) - Method in class de.fu_berlin.ties.classify.Prediction
Adds a new probability for the prediction.
addPunctuationDetails(TokenDetails) - Method in class de.fu_berlin.ties.extract.Extractor
Adds an element to the collected punctuation details.
addText(FeatureVector, String, String) - Method in class de.fu_berlin.ties.context.SimpleRepresentation
Adds feature(s) representing text to a feature vector, using the instance tokenizer for splitting the text into tokens.
addToken(TokenDetails, boolean) - Method in class de.fu_berlin.ties.extract.Extraction
Adds a token to this extraction, delegating to Extraction.addToken(TokenDetails, Probability, boolean) with a probability of -1 ("confirmed").
addToken(TokenDetails, Probability, boolean) - Method in class de.fu_berlin.ties.extract.Extraction
Adds a token to this extraction, recalculating the probability by multiplying the prior probability value with the probability of the new text.
addWhitespace() - Method in class de.fu_berlin.ties.text.TokenCounter
Adds whitespace to this instance, setting TokenCounter.isWhitespaceAfterLast() to true.
adjust(CharSequence, Writer) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Tries to fix corrupt XML documents, especially documents containing nesting errors.
adjust(Reader, Writer) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Tries to fix corrupt XML documents, especially documents containing nesting errors.
adjustWeights(Feature, short[]) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Adjusts the weights of a feature for all classes.
ALL - Static variable in class de.fu_berlin.ties.eval.MultiFMetrics
Reserved type used for the calculated sums and averages over all types: <All>.
allClasses() - Method in class de.fu_berlin.ties.combi.BeginAfterStrategy
Returns an array of immutable sets of all classes (Strings) that can possible occur during classification for each classifier.
allClasses() - Method in class de.fu_berlin.ties.combi.BeginEndStrategy
Returns an array of immutable sets of all classes (Strings) that can possible occur during classification for each classifier.
allClasses() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Returns an array of immutable sets of all classes (Strings) that can possible occur during classification for each classifier.
allClasses() - Method in class de.fu_berlin.ties.combi.InsideOutsideStrategy
Returns an array of immutable sets of all classes (Strings) that can possible occur during classification for each classifier.
allClasses() - Method in class de.fu_berlin.ties.combi.OpenCloseStrategy
Returns an array of immutable sets of all classes (Strings) that can possible occur during classification for each classifier.
allClasses() - Method in class de.fu_berlin.ties.combi.TrivialStrategy
Returns an array of immutable sets of all classes (Strings) that can possible occur during classification for each classifier.
ALTERNATIVE - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for answer keys that could have proposed as predictions but were not.
ANAKIA_TEMPLATE - Static variable in class de.fu_berlin.ties.util.VelocityService
Name of the main template used for Anakia XML format.
AnswerBuilder - Class in de.fu_berlin.ties.extract
Buildings an ExtractionContainer of answer keys from an annotated text (in XML format).
AnswerBuilder() - Constructor for class de.fu_berlin.ties.extract.AnswerBuilder
Creates a new instance, using a default extension and configuring the target structure from the standard configuration.
AnswerBuilder(String) - Constructor for class de.fu_berlin.ties.extract.AnswerBuilder
Creates a new instance, configuring the target structure from the standard configuration.
AnswerBuilder(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.AnswerBuilder
Creates a new instance, configuring the target structure from the provided configuration.
AnswerBuilder(String, TargetStructure, TokenizerFactory, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.AnswerBuilder
Creates a new instance.
appendPositiveNumber(StringBuffer, long, int, String, boolean) - Static method in class de.fu_berlin.ties.util.Util
Appends a non-negative number to a string buffer.
appendPunctuation(Extraction) - Method in class de.fu_berlin.ties.extract.Extractor
Appends the collected punctuation details (if any) to the provided extraction.
arrayAsSet(T[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Wraps an array into a set.
arrayIsEmpty(String[]) - Static method in class de.fu_berlin.ties.TiesConfiguration
Utility method that thorougly checks whether a string array is empty.
asBoolean(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a boolean primitive.
asBooleanArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a boolean array, calling Util.asBoolean(Object) on each element.
asByte(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a byte primitive.
asByteArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a byte array, calling Util.asByte(Object) on each element.
asChar(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a char primitive.
asCharArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a char array, calling Util.asChar(Object) on each element.
asDouble(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a double primitive.
asDoubleArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a double array, calling Util.asDouble(Object) on each element.
asFloat(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a float primitive.
asFloatArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a float array, calling Util.asFloat(Object) on each element.
asInt(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into an integer primitive.
asIntArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into an integer array, calling Util.asInt(Object) on each element.
asLong(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a long primitive.
asLongArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a long array, calling Util.asLong(Object) on each element.
asShort(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a short primitive.
asShortArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a short array, calling Util.asShort(Object) on each element.
asString(Object) - Static method in class de.fu_berlin.ties.util.Util
Converts an object into a String.
asStringArray(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Converts an object array into a String array, calling Util.asString(Object) on each element.
ATTRIBUTE - Static variable in class de.fu_berlin.ties.context.FeatureType
Constant for features representing attribute values.
attributeByName(Element, String) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Returns the attribute with the given name, compatible to the name format returned by DOMUtils.name(Attribute).
attributeCount() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns the number of attributes stored in this container.
attributeIterator() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns an iterator over the names of the attributes stored in this container.
avoids(Element) - Method in interface de.fu_berlin.ties.filter.ElementFilter
Whether the filter would avoid to filter an element, if possible.
avoids(Element) - Method in class de.fu_berlin.ties.filter.TrainableFilter
Whether the filter would avoid to filter an element, if possible.
AXIS_ANCESTOR - Static variable in class de.fu_berlin.ties.context.DefaultRepresentation
Ancestor axis.
AXIS_DESC_OR_SELF - Static variable in class de.fu_berlin.ties.context.DefaultRepresentation
Descendant-or-self axis.
AXIS_FOLLOW_SIBLING - Static variable in class de.fu_berlin.ties.context.DefaultRepresentation
Following sibling axis.
AXIS_PREC_SIBLING - Static variable in class de.fu_berlin.ties.context.DefaultRepresentation
Preceeding sibling axis.
AXIS_PRIOR - Static variable in class de.fu_berlin.ties.context.DefaultRepresentation
The pseudo-axis of prior recognitions.

B

backgroundMap() - Method in class de.fu_berlin.ties.io.FieldContainer
This map can be used to extend newly added field maps: at each FieldContainer.add(FieldMap) operation, any key/value pairs from this map are added to field map prior to storing it.
BaseSensor - Class in de.fu_berlin.ties.context.sensor
Abstract implementation of the Sensor interface that stores a configuration and provides a factory method to initialize a set of sensors.
BaseSensor(TiesConfiguration) - Constructor for class de.fu_berlin.ties.context.sensor.BaseSensor
Creates a new instance.
BaseStorable - Class in de.fu_berlin.ties.io
Convenience implementations of the Storable interface that implements the BaseStorable.toString() method based on the field map created by Storable.storeFields().
BaseStorable() - Constructor for class de.fu_berlin.ties.io.BaseStorable
Creates a new instance.
BeginAfterStrategy - Class in de.fu_berlin.ties.combi
A combination strategy using begin/after tagging (also called "BIA" tagging due to the prefixed used).
BeginAfterStrategy(Set<String>) - Constructor for class de.fu_berlin.ties.combi.BeginAfterStrategy
Creates a new instance.
BeginEndStrategy - Class in de.fu_berlin.ties.combi
A combination strategy that uses two classifiers, one to recognize the begin of extractions and one to recognize the end.
BeginEndStrategy(Set<String>) - Constructor for class de.fu_berlin.ties.combi.BeginEndStrategy
Creates a new instance.
best() - Method in class de.fu_berlin.ties.classify.PredictionDistribution
Returns the best (most probably) prediction in this distribution.
BOOLEAN_CLASSES - Static variable in class de.fu_berlin.ties.filter.TrainableFilter
An immutable set of the target classes for the classifier: the strings "false" and "true", in alphabetic order.
buildAnswers(Document) - Method in class de.fu_berlin.ties.extract.AnswerBuilder
Buildings an ExtractionContainer of answer keys from from an annotated XML document.
buildContext(Element, String, String, String, PriorRecognitions, Map<Element, List<LocalFeature>>, String) - Method in class de.fu_berlin.ties.context.AbstractRepresentation
Builds the context representation of text in an element.
buildContext(Document, PriorRecognitions, Map<Element, List<LocalFeature>>, String) - Method in class de.fu_berlin.ties.context.Representation
Builds the context representation of a document.
buildContext(Element, PriorRecognitions, Map<Element, List<LocalFeature>>, String) - Method in class de.fu_berlin.ties.context.Representation
Builds the context representation of an element.
buildContext(Element, String, String, String, PriorRecognitions, Map<Element, List<LocalFeature>>, String) - Method in class de.fu_berlin.ties.context.Representation
Builds the context representation of text in an element.
buildFeatures(String, Element, ElementPosition, boolean, LinkedList<Feature>, boolean, Map<Element, List<LocalFeature>>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Builds the features of an element and appends them to the specified featureList.
buildFeatures(Element) - Method in class de.fu_berlin.ties.filter.RepresentationFilter
Converts an element into a feature vector to be used by the trainable classifier.
buildFeatures(Element) - Method in class de.fu_berlin.ties.filter.TrainableFilter
Converts an element into a feature vector to be used by the trainable classifier.
buildLocalFeatures(Element, ElementPosition, boolean) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Builds the local features of an element.
buildPrior(PriorRecognitions) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Builds the pseudo-axis of prior recognitions.
buildTextFeatures(String, Element, String, String, String, LinkedList<Feature>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Builds the context representation of text in an element, differentiating between three kinds of textual contents: a left part, a main part, and a right part.

C

CALCULATED - Static variable in class de.fu_berlin.ties.context.FeatureType
Constant for features representing calculated values.
calculateHeadValues(Element, List<LocalFeature>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Creates values that depend on "head" children of an element, if the element contains any of them.
calculatePositionalValues(String, ElementPosition, List<LocalFeature>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Calculates values that depend on the position of an element within its parent.
calculateValuesFromText(String, String, List<LocalFeature>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Calculates values that depend on the textual content of an element: prefixes, suffixes, length data, and token type.
capturedText() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns the text captured within "capturing groups" in the last token.
CDATA_SECTION - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: a CDATA section.
checkEvent(String) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Method called by the XMLAdjuster.logEvent(String, String) methods whenever an event occurred to ensure the event is acceptable.
chooseClassesToAdjust(WinnowDistribution, String, Set<String>, Set<String>) - Method in class de.fu_berlin.ties.classify.winnow.UltraconservativeWinnow
Chooses the classes to promote and the classes to demote.
chooseClassesToAdjust(WinnowDistribution, String, Set<String>, Set<String>) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Chooses the classes to promote and the classes to demote.
classForName(String) - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns the TargetClass object for a given class name, if defined in this target structure.
Classifier - Interface in de.fu_berlin.ties.classify
Classes implementing this interface must be able to classify items represented by feature vectors.
classify(FeatureVector, Set) - Method in interface de.fu_berlin.ties.classify.Classifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes.
classify(FeatureVector, Set) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes.
classifyAndTrain(FieldContainer, File, String) - Method in class de.fu_berlin.ties.classify.ClassTrain
Classifies a list of files, training the classifier on each error.
ClassTrain - Class in de.fu_berlin.ties.classify
Classifies a list of files, training the classifier on each error.
ClassTrain() - Constructor for class de.fu_berlin.ties.classify.ClassTrain
Creates a new instance using a default extension and the standard configuration.
ClassTrain(String) - Constructor for class de.fu_berlin.ties.classify.ClassTrain
Creates a new instance using the standard configuration.
ClassTrain(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.ClassTrain
Creates a new instance from the provided configuration.
ClassTrain(String, TiesConfiguration, TokenizerFactory, String) - Constructor for class de.fu_berlin.ties.classify.ClassTrain
Creates a new instance.
cleanHTML(String, String) - Method in class de.fu_berlin.ties.preprocess.PreProcessor
Converts HTML input to a clean XHTML representation, if necessary.
clear() - Method in class de.fu_berlin.ties.util.MultiValueMap
Removes all mappings from this map.
clearPunctuation() - Method in class de.fu_berlin.ties.extract.Extractor
Clears the collected punctuation details.
close(int) - Method in interface de.fu_berlin.ties.Closeable
Closes this instance, releasing all resources and stopping any background threads.
close(int) - Method in class de.fu_berlin.ties.CollectingProcessor
Closes this instance, releasing all resources and stopping any background threads.
CLOSE - Static variable in class de.fu_berlin.ties.context.LocalFeature
Character closing a quoted value.
close(int) - Method in class de.fu_berlin.ties.extract.TrainEval
Closes this instance, releasing all resources and stopping any background threads.
close(int) - Method in class de.fu_berlin.ties.util.TaskRunner
Closes this task runner.
Closeable - Interface in de.fu_berlin.ties
Classes implementing this interface announce that they should be closed for releasing all resources and stopping any background activity.
CollectingProcessor - Class in de.fu_berlin.ties
Abstract base clase for a Processor that collects all the input arguments and processes the collected arguments when shutting down.
CollectingProcessor(TiesConfiguration) - Constructor for class de.fu_berlin.ties.CollectingProcessor
Creates a new instance.
CollectionUtils - Class in de.fu_berlin.ties.util
A static class that provides utility methods for working with Collections and arrays.
collectText(Branch) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Recursively collects the complete textual content of a branch, i.e.
collectText(Branch, StringBuffer) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Recursively collects the complete textual content of a branch, i.e.
collectText(Branch, Writer) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Recursively collects the complete textual content of a branch, i.e.
CombinationState - Class in de.fu_berlin.ties.combi
Immutable class contains state information used by combination strategies.
CombinationState(String, boolean, boolean) - Constructor for class de.fu_berlin.ties.combi.CombinationState
Creates a new instance, setting CombinationState.isDiscardPreceding() to false.
CombinationState(String, boolean, boolean, boolean) - Constructor for class de.fu_berlin.ties.combi.CombinationState
Creates a new instance.
CombinationStrategy - Class in de.fu_berlin.ties.combi
A combination strategy combines sequential classifications.
CombinationStrategy(Set<String>) - Constructor for class de.fu_berlin.ties.combi.CombinationStrategy
Creates a new instance.
combineArrays(Object[], Object[], Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Combines two array into a target array, inserting all elements of the first array and then all elements of the second array in the target array.
COMMENT - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: a comment.
COMMENT_START - Static variable in class de.fu_berlin.ties.io.DelimSepValues
The character introducing a comment: '#'.
compare(Prediction, Prediction) - Method in class de.fu_berlin.ties.classify.PredictionComparator
Compares two Predictions.
compare(T, T) - Method in class de.fu_berlin.ties.util.InvertedComparator
Compares its two arguments for order.
completeTemplateName(String) - Static method in class de.fu_berlin.ties.util.VelocityService
Completes the short form of a template name, by prepending the VelocityService.TEMPLATE_DIR and appending the VelocityService.TEMPLATE_EXT.
CONF - Static variable in class de.fu_berlin.ties.TiesConfiguration
The main configuration object for TIES, loaded via TiesConfiguration.TiesConfiguration(String) using "ties" as base name.
CONF_DIR - Static variable in class de.fu_berlin.ties.TiesConfiguration
The subdirectory in the class path containing config files.
CONF_EXTENSION - Static variable in class de.fu_berlin.ties.TiesConfiguration
The extension of config files in PropertiesConfiguration format.
confidence(float, float) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Converts a sigmoid activation value into a confidence estimate.
CONFIG_AVOID - Static variable in class de.fu_berlin.ties.extract.ExtractorBase
Configuration key: List of elements that should be avoided when filtering (using parent element instead).
CONFIG_CLASSIFIER - Static variable in interface de.fu_berlin.ties.classify.Classifier
Base configuration key for classifiers.
CONFIG_COMBINATION - Static variable in class de.fu_berlin.ties.combi.CombinationStrategy
Configuration key specifying the combination strategy to use.
CONFIG_COMPRESS_GZIP - Static variable in class de.fu_berlin.ties.io.IOUtils
Configuration key: whether to compress your data in gzip format.
CONFIG_DELETE_CONTROL_CHARS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Configuration key: whether to delete control characters (which are not allowed in XML 1.0 and discouraged in XML 1.1).
CONFIG_DELETE_PSEUDO_TAGS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Configuration key: whether to delete "pseudo-tags".
CONFIG_DIR - Static variable in class de.fu_berlin.ties.classify.ExternalClassifier
Configuration key: the directory to run the classifier in (optional, defaults to current working directory).
CONFIG_ELEMENTS - Static variable in class de.fu_berlin.ties.extract.ExtractorBase
Configuration key: List of elements to filter.
CONFIG_EMPTIABLE_TAGS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Configuration key: Set of names of tags that can be converted empty tags when required.
CONFIG_ESCAPE_PSEUDO_ENTITIES - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Configuration key: whether to escape "&" starting a possible nonstandard entity reference ("&" at the start of one of the 5 predefined entity references or a character reference is never escaped, all other "&" are always escaped).
CONFIG_FEEDBACK - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: If true, a fully incremental setup is used where the trainer is trained on each document after the extractor processed it.
CONFIG_FILE_EXT - Static variable in class de.fu_berlin.ties.classify.ClassTrain
Configuration key: The extension to append to file names given via the File key (if any).
CONFIG_GOAL_PREFIX - Static variable in class de.fu_berlin.ties.TiesConfiguration
Configuration key prefix mapping goals to fully qualified class names.
CONFIG_HTMLCONV_COMMAND - Static variable in class de.fu_berlin.ties.preprocess.PreProcessor
Configuration key prefix: command name and arguments of an external converter from a specified type to HTML.
CONFIG_LANG - Static variable in class de.fu_berlin.ties.TiesConfiguration
Special configuration key: the language of documents, using the ISO 639 language codes (2-letter codes where available, e.g.
CONFIG_LOGGER_LOG - Static variable in class de.fu_berlin.ties.util.Util
Configuration key: Only messages with this priority or higher are logged.
CONFIG_LOGGER_SHOW - Static variable in class de.fu_berlin.ties.util.Util
Configuration key: Only messages with this priority or higher are written to standard output (but only if covered by Util.CONFIG_LOGGER_LOG).
CONFIG_MATCH_ALL - Static variable in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Configuration key: whether to use match-all or match-best as match mode.
CONFIG_MATCH_POSITION - Static variable in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Configuration key for EvaluatedExtractionContainer.isMatchingPosition().
CONFIG_MISSING_ROOT - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Configuration key: the name to use for the root element if missing.
CONFIG_POST - Static variable in class de.fu_berlin.ties.TextProcessor
Configuration prefix for post-processors.
CONFIG_PREPROCESS_TAGGER - Static variable in class de.fu_berlin.ties.preprocess.PreProcessor
Configuration key: A tagger (or a list of taggers) used to annotate a text e.g. with linguistic information.
CONFIG_PREPROCESS_TEXT - Static variable in class de.fu_berlin.ties.preprocess.PreProcessor
Configuration key: Whether plain text is preprocessed to recognize and reformat definition lists.
CONFIG_RECOGN_NUM - Static variable in class de.fu_berlin.ties.context.AbstractRepresentation
Configuration key: The number of preceding recognitions to represent.
CONFIG_RELEVANT_PUNCTUATION - Static variable in class de.fu_berlin.ties.extract.ExtractorBase
Configuration key: list of punctuation and symbol tokens that are considered as relevant from the very start.
CONFIG_SENTENCE - Static variable in class de.fu_berlin.ties.extract.ExtractorBase
Configuration suffix/prefix used for sentence filtering.
CONFIG_SENTENCE_TUNE - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: The maximum number of iterations used for TUNE training the sentence classifier; if 0 or negative, the value of TrainEval.CONFIG_TUNE is used.
CONFIG_SPLIT_MAXIMUM - Static variable in class de.fu_berlin.ties.context.AbstractRepresentation
Configuration key: The maximum number of subsequences to keep when a feature value must be split.
CONFIG_STORE_NTH - Static variable in class de.fu_berlin.ties.context.AbstractRepresentation
Configuration key: Each n-th context representation is stored for debugging and inspection purposes,if > 0.
CONFIG_TARGET_CLASSES - Static variable in class de.fu_berlin.ties.extract.TargetStructure
Configuration key: names of the classes to recognize (temporarily).
CONFIG_TEMPLATE - Static variable in class de.fu_berlin.ties.util.VelocityService
Name of the default template used to print configuration parameters.
CONFIG_TEST_ONLY - Static variable in class de.fu_berlin.ties.extract.Trainer
Configuration key determining whether the trainer only ensures that all answer keys exist and can be located in the document instead of doing any training.
CONFIG_TEST_SPLIT - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: The percentage of a corpus to use for testing (evaluation).
CONFIG_TOE - Static variable in class de.fu_berlin.ties.extract.Trainer
Configuration key for determining the training mode (Trainer.isTrainingOnlyErrors()).
CONFIG_TOKEN_PATTERNS - Static variable in class de.fu_berlin.ties.text.TokenizerFactory
Configuration key for the array of regular expressions defining the token types accepted by the tokenizer.
CONFIG_TRAIN_SPLIT - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: The percentage of a corpus to use for training.
CONFIG_TRANSFORMERS - Static variable in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Configuration key specifying the list of transformers to chain.
CONFIG_TUNE - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: The maximum number of iterations used for TUNE (train until no error) training; if 1, training is incremental.
CONFIG_TUNE_EACH - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: Whether to measure results after each TUNE iteration or only at the end of training.
CONFIG_TUNE_SINCE - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: The training iteration after which to evaluate results for the first time if TrainEval.CONFIG_TUNE_EACH is enabled.
CONFIG_TUNE_STOP - Static variable in class de.fu_berlin.ties.extract.TrainEval
Configuration key: TUNE training is stopped if the training accuracy didn't improve for the specified number of iterations.
CONFIG_WHITESPACE_PATTERN - Static variable in class de.fu_berlin.ties.text.TokenizerFactory
Configuration key for the regular expression giving the whitespace accepted by the tokenizer.
ConfigurableProcessor - Class in de.fu_berlin.ties
An abstract processor that just stores a configuration.
ConfigurableProcessor(TiesConfiguration) - Constructor for class de.fu_berlin.ties.ConfigurableProcessor
Creates a new instance.
configureFromArgs(String[]) - Method in class de.fu_berlin.ties.TiesConfiguration
Modifies configuration properties from [+|-]key[=value] pairs in a string array.
configureLog(TiesConfiguration) - Static method in class de.fu_berlin.ties.util.Util
Reconfigures the logger from the Util.CONFIG_LOGGER_LOG and Util.CONFIG_LOGGER_SHOW values in the provided configuration.
contains(String) - Method in class de.fu_berlin.ties.xml.TagContainer
Checks whether this instance contains at least one appearance of the specified tag.
containsExtraction(Element) - Method in class de.fu_berlin.ties.filter.EmbeddingElements
Checks whether the given element contains an extraction (or parts of an extraction).
containsKey(K) - Method in class de.fu_berlin.ties.util.MultiValueMap
Returns true if this map contains a mapping for the specified key.
containsNonTentative(String) - Method in class de.fu_berlin.ties.xml.OpenTags
Checks whether this instance contains at least one appearance of the specified tag that is not TagVariety.TENTATIVE.
containsValue(V) - Method in class de.fu_berlin.ties.util.MultiValueMap
Checks whether the map contains the value specified.
containsValue(K, V) - Method in class de.fu_berlin.ties.util.MultiValueMap
Checks whether the collection at the specified key contains the value.
ContentType - Class in de.fu_berlin.ties.io
A simple wrapper for the MIME type and the character set of a document.
ContextDetails - Class in de.fu_berlin.ties.extract.amend
Extends the TokenDetails class by also storing the context of a token.
ContextDetails(String, int, int, boolean, FeatureVector) - Constructor for class de.fu_berlin.ties.extract.amend.ContextDetails
Creates a new instance.
ContextDetails(TokenDetails, FeatureVector) - Constructor for class de.fu_berlin.ties.extract.amend.ContextDetails
Creates a new instance, re-using field values from a TokenDetails instance.
ContextMap - Class in de.fu_berlin.ties
A map used to store context-specific key/value pairs in the processing architecture.
ContextMap() - Constructor for class de.fu_berlin.ties.ContextMap
Creates a new instance.
ContextMap(int) - Constructor for class de.fu_berlin.ties.ContextMap
Creates a new instance.
ContextMap(int, float) - Constructor for class de.fu_berlin.ties.ContextMap
Creates a new instance.
ContextMap(Map<String, Object>) - Constructor for class de.fu_berlin.ties.ContextMap
Creates a new instance.
CONTINUATION - Static variable in class de.fu_berlin.ties.xml.TagVariety
Constant for start tags that were created when splitting a tag.
CONTROL_CHARS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Pattern specifying sequences of control characters (character codes below the space character, except tab, line feed and carriage return).
CORRECT - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for correct predictions (true positives) resp for answer keys matched by a prediction.
CORRECT_CLASS - Static variable in class de.fu_berlin.ties.classify.ClassTrain
Value of the ClassTrain.KEY_CLASSIFICATION field for correct predictions: "+".
countDocument() - Method in class de.fu_berlin.ties.eval.FeatureCount
Counts a document (increases the number of documents by one.
countFirst(String, char) - Static method in class de.fu_berlin.ties.text.TextUtils
Counts how often a character is repeated at the begin of a string.
countLast(String, char) - Static method in class de.fu_berlin.ties.text.TextUtils
Counts how often a character is repeated at the end of a string.
createAlnumTokenizer(CharSequence) - Static method in class de.fu_berlin.ties.text.TokenizerFactory
Static factory method to create an instance for tokenizing alphanumeric and symbol sequences and puntuation.
createAttributeFeatures(String, String, String, int) - Static method in class de.fu_berlin.ties.context.LocalFeature
Factory method creating a feature that wraps an attribute value.
createBinarySet(String) - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Helper method that creates a set containing the two classes of a binary classifier.
createBinarySet(String) - Method in class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Helper method that creates a set containing the two classes of a binary classifier.
createCalculatedFeature(String, String, String) - Static method in class de.fu_berlin.ties.context.LocalFeature
Factory method creating a feature that wraps a calculated value.
createCalculatedFeatures(String, String, String, int) - Static method in class de.fu_berlin.ties.context.LocalFeature
Factory method creating a feature that wraps a calculated value that might contain whitespace.
createCategoryTokenizer(CharSequence) - Static method in class de.fu_berlin.ties.text.TokenizerFactory
Static factory method to create an instance for tokenizing according to Unicode categories.
createClassifier(Set<String>) - Static method in class de.fu_berlin.ties.classify.TrainableClassifier
Factory method that delegates to TrainableClassifier.createClassifier(Set, TiesConfiguration) using the standard configuration.
createClassifier(Set<String>, TiesConfiguration) - Static method in class de.fu_berlin.ties.classify.TrainableClassifier
Factory method that delegates to TrainableClassifier.createClassifier(Set, File, TiesConfiguration) without specifying an run directory.
createClassifier(Set<String>, File, TiesConfiguration) - Static method in class de.fu_berlin.ties.classify.TrainableClassifier
Factory method that delegates to TrainableClassifier.createClassifier(Set, File, FeatureTransformer, String[], TiesConfiguration).
createClassifier(Set<String>, File, FeatureTransformer, String[], TiesConfiguration) - Static method in class de.fu_berlin.ties.classify.TrainableClassifier
Factory method that creates a trainable classifier based on the provided specification.
createCollection(Collection<? extends V>) - Method in class de.fu_berlin.ties.util.MultiValueMap
Creates a new instance of the map value Collection container.
createCollection(Collection<? extends V>) - Method in class de.fu_berlin.ties.util.SortedMultiValueMap
Creates a new instance of the map value Collection container. This implementation returns a TreeSet.
createdEvalContainer() - Method in class de.fu_berlin.ties.eval.ReEvaluator
Created an empty container for evaluating extractions, using the stored configuration to initialize target structure and match mode.
createElementFeature(String) - Static method in class de.fu_berlin.ties.context.LocalFeature
Factory method creating a feature that just represents an element itself.
createFieldContainer() - Static method in class de.fu_berlin.ties.io.FieldContainer
Factory method that creates a field container in DelimSepValues format.
createFieldContainer(CharSequence) - Static method in class de.fu_berlin.ties.io.FieldContainer
Factory method that creates a field container from serialized data in DelimSepValues format.
createFieldContainer(InputStream) - Static method in class de.fu_berlin.ties.io.FieldContainer
Factory method that creates a field container from serialized data in DelimSepValues format.
createFieldContainer(Reader) - Static method in class de.fu_berlin.ties.io.FieldContainer
Factory method that creates a field container from serialized data in DelimSepValues format.
createFilteringTokenWalker(TrainableFilter) - Method in class de.fu_berlin.ties.extract.Extractor
Creates a filtering token walker to be used for walking through a document and sentence classification if a double classification approach is used.
createFilteringTokenWalker(TrainableFilter) - Method in class de.fu_berlin.ties.extract.ExtractorBase
Creates a filtering token walker to be used for walking through a document and sentence classification if a double classification approach is used.
createFilteringTokenWalker(TrainableFilter) - Method in class de.fu_berlin.ties.extract.Trainer
Creates a filtering token walker to be used for walking through a document and sentence classification if a double classification approach is used.
createMetrics() - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Helper factory method that creates a new FMetrics instance of the required type.
createMetrics(FieldMap) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Helper factory method that creates a new FMetrics instance of the required type from a field map.
createNestedContainer(String) - Method in class de.fu_berlin.ties.io.FieldContainer
Creates and returns a new nested subcontainer.
createObject(Class) - Method in class de.fu_berlin.ties.io.FieldMap
Creates (deserializes) an object of a specified type by calling a constructor of the class that accepts a field map as single argument and passing itself as parameter.
createObject(String[]) - Static method in class de.fu_berlin.ties.util.Util
Delegates to Util.createObject(Class, String[]), reading the class name from the first element in the array.
createObject(Class, String[]) - Static method in class de.fu_berlin.ties.util.Util
Delegates to Util.createObject(Class, Object[], Class), setting the paramType to the String class.
createObject(Class, Object[], Class) - Static method in class de.fu_berlin.ties.util.Util
Creates an object of a specified type.
createObject(Class, Object[], Class[]) - Static method in class de.fu_berlin.ties.util.Util
Creates an object of a specified type.
createObject(Element, Class) - Static method in class de.fu_berlin.ties.xml.io.ObjectElement
Creates (deserializes) an object of a specified type by calling a constructor of the class that accepts an XML element as single argument and passing itself as parameter.
createObject() - Method in class de.fu_berlin.ties.xml.io.ObjectElement
Creates (deserializes) an object of a specified type by calling a constructor of the class that accepts an XML element as single argument and passing itself as parameter.
createObjects(Class) - Method in class de.fu_berlin.ties.io.FieldContainer
Creates (deserializes) an list of objects of a specified type by calling FieldMap.createObject(Class) for each of the field maps contained in this container.
createOutFile(File, String) - Static method in class de.fu_berlin.ties.io.IOUtils
Creates a file for writing output to.
createOutFile(File, String, String) - Static method in class de.fu_berlin.ties.io.IOUtils
Creates a file for writing output to.
createOutFile(File, String, String, MutableInt) - Static method in class de.fu_berlin.ties.io.IOUtils
Creates a file for writing output to.
createRecognitionBuffer() - Method in class de.fu_berlin.ties.context.Representation
Creates a buffer to be filled with prior Recognitions and passed as argument to the Representation.buildContext(Element, String, String, String, PriorRecognitions, Map, String) method.
createSensors(String[], TiesConfiguration) - Static method in class de.fu_berlin.ties.context.sensor.BaseSensor
Factory method that initializes an array of sensors.
createSentenceFilter(TiesConfiguration, Representation) - Static method in class de.fu_berlin.ties.extract.ExtractorBase
Helper methat that initializes the filter to be used for the first step of a double classification approach ("sentence filtering").
createStrategy(Set<String>) - Static method in class de.fu_berlin.ties.combi.CombinationStrategy
Factory method that delegates to CombinationStrategy.createStrategy(Set, TiesConfiguration) using the standard configuration.
createStrategy(Set<String>, TiesConfiguration) - Static method in class de.fu_berlin.ties.combi.CombinationStrategy
Factory method that creates a combination strategy based on the CombinationStrategy.CONFIG_COMBINATION key in the provided configuration.
createTextFeatures(String, String, int) - Static method in class de.fu_berlin.ties.context.LocalFeature
Factory method creating a feature that wraps textual content of an element.
createThoroughTokenizer(CharSequence) - Static method in class de.fu_berlin.ties.text.TokenizerFactory
Static factory method to create an instance that uses the "thorough" patterns listed below.
createTokenizer(CharSequence) - Method in class de.fu_berlin.ties.text.TokenizerFactory
Factory method to create an instance from the configured token and whitespace patterns.
createTransformer() - Static method in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Factory method that delegates to FeatureTransformer.createTransformer() using the standard configuration.
createTransformer(TiesConfiguration) - Static method in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Factory method that creates a feature transformer based on the FeatureTransformer.CONFIG_TRANSFORMERS key in the provided configuration.
createXMLTokenizer(CharSequence, boolean) - Static method in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Factory method to create an instance for parsing files in XML syntax.

D

de.fu_berlin.ties - package de.fu_berlin.ties
This package bundles main entry points and general interfaces and classes for the TIE software.
de.fu_berlin.ties.classify - package de.fu_berlin.ties.classify
This package provides functionality for classification of texts and feature vectors.
de.fu_berlin.ties.classify.feature - package de.fu_berlin.ties.classify.feature
This package contains classes for working with features, feature vectors and feature transformers.
de.fu_berlin.ties.classify.winnow - package de.fu_berlin.ties.classify.winnow
This package contains the Winnow classification algorithm and related algorithms and classes.
de.fu_berlin.ties.combi - package de.fu_berlin.ties.combi
This package provides combination strategies for combining sequential classification decisions.
de.fu_berlin.ties.context - package de.fu_berlin.ties.context
This packages provides functionality for building and managing representations of context in texts (XML documents).
de.fu_berlin.ties.context.sensor - package de.fu_berlin.ties.context.sensor
Sensors are object that look up information for a token, for example semantic information from gazetteers or thesauri.
de.fu_berlin.ties.eval - package de.fu_berlin.ties.eval
This packages provides functionality for evaluating results of classifiers and extractors.
de.fu_berlin.ties.extract - package de.fu_berlin.ties.extract
This package handles information extraction and entitiy recognition.
de.fu_berlin.ties.extract.amend - package de.fu_berlin.ties.extract.amend
This package provides code for reanalysing proposed extractions and performing suitable amendments to improve results.
de.fu_berlin.ties.filter - package de.fu_berlin.ties.filter
This packages provides generic filtering functionality and an implementation for sentence filtering.
de.fu_berlin.ties.io - package de.fu_berlin.ties.io
This package provides classes for input/output handling and for (de)serialization.
de.fu_berlin.ties.preprocess - package de.fu_berlin.ties.preprocess
This packages handles format conversions and linguistic preprocessing of documents.
de.fu_berlin.ties.text - package de.fu_berlin.ties.text
This package contains utility classes for working with texts.
de.fu_berlin.ties.util - package de.fu_berlin.ties.util
This package contains miscellaneous utility classes.
de.fu_berlin.ties.xml - package de.fu_berlin.ties.xml
This package contains utility classes for working with XML documents and related data.
de.fu_berlin.ties.xml.dom - package de.fu_berlin.ties.xml.dom
This package contains utility classes for working with DOM-like XML representations, focussing especially on dom4j.
de.fu_berlin.ties.xml.io - package de.fu_berlin.ties.xml.io
This package provides functionality for reading and writing XML data.
DEFAULT_NAME - Static variable in class de.fu_berlin.ties.util.TaskRunner
The base name of worker threads used by the default instance.
DEFAULT_SEPARATOR - Static variable in class de.fu_berlin.ties.classify.feature.SBPHTransformer
The separator used by default (a space character).
DefaultFeature - Class in de.fu_berlin.ties.classify.feature
Default implementation of the Feature class.
DefaultFeature(String) - Constructor for class de.fu_berlin.ties.classify.feature.DefaultFeature
Creates a new instance, without storing a comment.
DefaultFeature(String, String) - Constructor for class de.fu_berlin.ties.classify.feature.DefaultFeature
Creates a new instance, setting the strength to 1.0.
DefaultFeature(String, String, float) - Constructor for class de.fu_berlin.ties.classify.feature.DefaultFeature
Creates a new instance.
DefaultFeatureVector - Class in de.fu_berlin.ties.classify.feature
This feature vector implementation stores an ordered sequence of features.
DefaultFeatureVector() - Constructor for class de.fu_berlin.ties.classify.feature.DefaultFeatureVector
Creates a new instance.
DefaultRepresentation - Class in de.fu_berlin.ties.context
The context representation used by default.
DefaultRepresentation() - Constructor for class de.fu_berlin.ties.context.DefaultRepresentation
Creates a new instance based on the standard configuration.
DefaultRepresentation(TiesConfiguration) - Constructor for class de.fu_berlin.ties.context.DefaultRepresentation
Creates a new instance based on the provided configuration.
DefaultRepresentation(int, int, int, int, int, int, String, String, String[], int, String, String[], TiesConfiguration) - Constructor for class de.fu_berlin.ties.context.DefaultRepresentation
Creates a new instance.
defaultWeight() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns the default weight to use if a feature is unknown.
DELIM - Static variable in class de.fu_berlin.ties.io.DelimSepValues
The field delimiter character: '|' (a pipe).
DelimSepValues - Class in de.fu_berlin.ties.io
A field container that stores and processed its contents as values separated by a pipe character ('|').
DelimSepValues() - Constructor for class de.fu_berlin.ties.io.DelimSepValues
Creates a new empty instance.
DelimSepValues(CharSequence) - Constructor for class de.fu_berlin.ties.io.DelimSepValues
Creates a new instance from serialized delimiter-separated values.
DelimSepValues(InputStream) - Constructor for class de.fu_berlin.ties.io.DelimSepValues
Creates a new instance from serialized delimiter-separated values.
DelimSepValues(Reader) - Constructor for class de.fu_berlin.ties.io.DelimSepValues
Creates a new instance from serialized delimiter-separated values.
DelimSepValues(StorableContainer) - Constructor for class de.fu_berlin.ties.io.DelimSepValues
Creates a new instance and populates it from a StorableContainer.
deregisterInterest() - Static method in class de.fu_berlin.ties.util.TaskRunner
Deregisters interest to use the default runner.
DESC_EXTENSION - Static variable in class de.fu_berlin.ties.TiesConfiguration
The extension of descriptor configuration files (in PropertiesConfiguration format).
determineCharset(InputStreamReader) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the charset used by an InputStreamReader.
determineCharset(OutputStreamWriter) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the charset used by an OutputStreamWriter.
determineCharsetName(InputStreamReader) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the canoncical name of the charset used by an InputStreamReader.
determineCharsetName(OutputStreamWriter) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the canoncical name of the charset used by an OutputStreamWriter.
determineContentType(File, TiesConfiguration) - Static method in class de.fu_berlin.ties.io.ContentType
Determines the content type based on a file name.
determineContentType(URL, String, TiesConfiguration) - Static method in class de.fu_berlin.ties.io.ContentType
Helper method to determines the content type based an "Content-Type" header and/or URL extension.
determineHeadValue(Element) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Helper method for determining the head value for an element of type DefaultRepresentation.getHeadElement().
determineOutputDirectory(TiesConfiguration) - Static method in class de.fu_berlin.ties.io.IOUtils
Determines the output to directory to use, reading it from the TextProcessor.KEY_OUT_DIRECTORY configuration key in a given configuration.
determineRoughPosition(int, int) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Helper method called by DefaultRepresentation.calculatePositionalValues(String, ElementPosition, List) to collapse a position in to one of five values.
DirectoryProcessor - Class in de.fu_berlin.ties
Abstract base class for a Processor that operates on the contents of directories.
DirectoryProcessor(FileFilter, TiesConfiguration) - Constructor for class de.fu_berlin.ties.DirectoryProcessor
Creates a new instance.
disableSentenceTraining() - Method in class de.fu_berlin.ties.extract.Trainer
Disables training the embedded sentence filter, if sentence filtering is used.
doBuildContext(Element, String, String, String, PriorRecognitions, Map<Element, List<LocalFeature>>, String) - Method in class de.fu_berlin.ties.context.AbstractRepresentation
Builds the context representation of text in an element.
doBuildContext(Element, String, String, String, PriorRecognitions, Map<Element, List<LocalFeature>>, String) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Builds the context representation of text in an element.
doBuildContext(Element, String, String, String, PriorRecognitions, Map, String) - Method in class de.fu_berlin.ties.context.SimpleRepresentation
Builds the context representation of text in an element.
doClassify(FeatureVector, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.ExternalClassifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes.
doClassify(FeatureVector, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.MetaClassifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes.
doClassify(FeatureVector, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes. This implementation combines the predictions for the foreground of all involved inner classifiers.
doClassify(FeatureVector, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes. This implementation combines the predictions for the positive class of all involved inner classifiers.
doClassify(FeatureVector, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes.
doClassify(FeatureVector, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Classifies an item that is represented by a feature vector by choosing the most probable class among a set of candidate classes.
DOCTYPE - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: a document type declaration.
DocumentProcessor - Class in de.fu_berlin.ties
Abstract base class for a Processor that operates on XML documents.
DocumentProcessor(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.DocumentProcessor
Creates a new instance.
DocumentReader - Class in de.fu_berlin.ties
Abstract base class for a Processor that read XML documents.
DocumentReader(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.DocumentReader
Creates a new instance.
DocumentSerializer - Interface in de.fu_berlin.ties.xml.io
Interface for classes that support reading and writing XML documents.
DocumentWalker - Class in de.fu_berlin.ties.xml.dom
Walks through a document, handing the elements matched by a NodeFilter over to an ElementProcessor.
DocumentWalker(NodeFilter, ElementProcessor, TokenizerFactory) - Constructor for class de.fu_berlin.ties.xml.dom.DocumentWalker
Creates a new instance.
DOMUtils - Class in de.fu_berlin.ties.xml.dom
A static class that provides utility constants and methods for working with DOM-like XML representations, focussing especially on dom4j.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.classify.ClassTrain
Delegates to ClassTrain.classifyAndTrain(FieldContainer, File, String).
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.DocumentReader
Processes the contents of a reader, writing a modified version to a writer.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.eval.ReEvaluator
Processes the contents of a reader, writing a modified version to a writer.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.extract.TrainEval
Processes the contents of a reader, writing a modified version to a writer.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.preprocess.PreProcessor
Preprocesses the contents of a file.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.preprocess.TreeTagger
Augments the input text with the output of the TreeTagger.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.TextProcessor
Processes the contents of a reader, writing a modified version to a writer.
doProcess(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Tries to fix corrupt XML documents, especially documents containing nesting errors.
doTrain(FeatureVector, String, ContextMap) - Method in class de.fu_berlin.ties.classify.ExternalClassifier
Incorporates an item that is represented by a feature vector into the classification model.
doTrain(FeatureVector, String, ContextMap) - Method in class de.fu_berlin.ties.classify.MetaClassifier
Incorporates an item that is represented by a feature vector into the classification model.
doTrain(FeatureVector, String, ContextMap) - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Incorporates an item that is represented by a feature vector into the classification model.
doTrain(FeatureVector, String, ContextMap) - Method in class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Incorporates an item that is represented by a feature vector into the classification model.
doTrain(FeatureVector, String, ContextMap) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Incorporates an item that is represented by a feature vector into the classification model.
doTrain(FeatureVector, String, ContextMap) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Winnow supports only error-driven training, so you always have to use the TrainableClassifier.trainOnError(FeatureVector, String, Set) method instead of this one.
doTransform(FeatureVector) - Method in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Transforms a feature vector.
doTransform(FeatureVector) - Method in class de.fu_berlin.ties.classify.feature.OSBTransformer
Transforms a feature vector.
doTransform(FeatureVector) - Method in class de.fu_berlin.ties.classify.feature.SBPHTransformer
Transforms a feature vector.
DUMMY - Static variable in class de.fu_berlin.ties.xml.OpenTags
A dummy object stored as map value where no value is required.

E

ELEMENT - Static variable in class de.fu_berlin.ties.context.FeatureType
Constant for features representing an element itself (marker is empty just as for text).
ElementFilter - Interface in de.fu_berlin.ties.filter
Interface for classes that decide whether or not to accept elements in XML documents.
ElementNameFilter - Class in de.fu_berlin.ties.xml.dom
A node filter that only accepts elements with a specified name.
ElementNameFilter(Set<String>) - Constructor for class de.fu_berlin.ties.xml.dom.ElementNameFilter
Creates a new instance.
ElementNameFilter(List<String>) - Constructor for class de.fu_berlin.ties.xml.dom.ElementNameFilter
Creates a new instance.
ElementNameFilter(String[]) - Constructor for class de.fu_berlin.ties.xml.dom.ElementNameFilter
Creates a new instance.
ElementNameFilter(String) - Constructor for class de.fu_berlin.ties.xml.dom.ElementNameFilter
Creates a new instance that accepts only elements of a single type.
ElementPosition - Class in de.fu_berlin.ties.context
A small immutable wrapper class that bundles the position of an element within a parent element and the number of children of the parent.
ElementPosition(int, int, int, int, int, int) - Constructor for class de.fu_berlin.ties.context.ElementPosition
Creates a new instance.
ElementProcessor - Interface in de.fu_berlin.ties.xml.dom
Interface for a processor that operates on elements in XML documents.
elementsByName(Element, String) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Returns the child elements with the given name, compatible to the name format returned by DOMUtils.name(Element).
EmbeddingElements - Class in de.fu_berlin.ties.filter
Determines which elements in a document contain extractions (or parts of extractions).
EmbeddingElements(Document, ExtractionContainer, TokenizerFactory) - Constructor for class de.fu_berlin.ties.filter.EmbeddingElements
Creates a new instance.
EMPTY_TAG - Static variable in class de.fu_berlin.ties.xml.TagConstituent
Type constant: an empty tag.
enableSentenceTraining() - Method in class de.fu_berlin.ties.extract.Trainer
Re-enables training the embedded filter, if sentence filtering is used.
END_TAG - Static variable in class de.fu_berlin.ties.xml.TagConstituent
Type constant: an end tag.
endOfExtraction() - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Whether we reached the end of the current extraction.
endTagCount(String) - Method in class de.fu_berlin.ties.xml.UnprocessedTags
Returns the number of end tags with the specified name contained in this container.
ensureNonNegative(double, String) - Static method in class de.fu_berlin.ties.util.Util
Ensures that a number is positive or 0, throwing an exception if this condition is violated.
ensureNonNegative(long, String) - Static method in class de.fu_berlin.ties.util.Util
Ensures that a number is positive or 0, throwing an exception if this condition is violated.
ensurePrintableName(String) - Static method in class de.fu_berlin.ties.text.TextUtils
Checks that a string is a printable name, meaning it has at at least one character and does not contain any whitespace.
entryIterator() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns an iterator over the FieldMaps in this container in the order they were added.
equals(Object) - Method in class de.fu_berlin.ties.classify.feature.Feature
Indicates whether some other object is "equal to" this one, fulfulling the Object.equals(java.lang.Object) contract.
equals(Object) - Method in class de.fu_berlin.ties.classify.Prediction
Indicates whether some other object is "equal to" this one, fulfulling the Object.equals(java.lang.Object) contract.
equals(Object) - Method in class de.fu_berlin.ties.combi.CombinationState
Indicates whether some other object is "equal to" this one, fulfulling the Object.equals(java.lang.Object) contract.
equals(Object) - Method in class de.fu_berlin.ties.extract.Extraction
Indicates whether some other object is "equal to" this one, fulfulling the Object.equals(java.lang.Object) contract.
equals(Object) - Method in class de.fu_berlin.ties.extract.TargetClass
Indicates whether some other object is "equal to" this one, fulfulling the Object.equals(java.lang.Object) contract.
equals(Object) - Method in class de.fu_berlin.ties.extract.TargetStructure
Indicates whether some other object is "equal to" this one, fulfulling the Object.equals(java.lang.Object) contract.
equals(Object) - Method in class de.fu_berlin.ties.util.MultiValueMap
Compares the specified object with this map for equality.
ESCAPE - Static variable in class de.fu_berlin.ties.io.DelimSepValues
The escape character: '\' (a backslash).
ESCAPED_AMP - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Escape sequence for the "&" character.
EvalInput - Interface in de.fu_berlin.ties.eval
Classes implementing this interface provide input for calculating evaluation metrics: true positives, false negatives and false positives.
EvalStatus - Class in de.fu_berlin.ties.eval
Type-safe enumeration of possible evaluation states for predictions (unknown, correct, spurious etc.) and answer keys (missing etc.).
evaluate(EmbeddingElements, FilteringTokenWalker) - Static method in class de.fu_berlin.ties.filter.FilterEvaluator
Evaluates precision, recall, and F1 measure of the last walk performed by a filtering token woken.
evaluate(EmbeddingElements, Set, Set) - Static method in class de.fu_berlin.ties.filter.FilterEvaluator
Evaluates precision, recall, and F1 measure of an element filter by comparing the accepted and rejected elements of the filter with the correct set of embeddingElements.
evaluateBatch(ExtractionContainer, ExtractionContainer, String) - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Evaluates a container of predicted extractions against a container of true extractions (answer keys) and adds them to this instance.
EvaluatedExtractionContainer - Class in de.fu_berlin.ties.extract
An extraction container that evaluates containers of predicted extractions against containers of true extractions (answer keys) and merges their contents, setting the evaluation states accordingly.
EvaluatedExtractionContainer(TargetStructure, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Creates a new instance.
EvaluatedExtractionContainer(TargetStructure, boolean, boolean) - Constructor for class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Creates a new instance.
evaluateSentenceFiltering(ExtractionContainer) - Method in class de.fu_berlin.ties.extract.Extractor
Evaluates precision and recall for sentence filtering on the last processed document.
evaluateSentenceFiltering(EmbeddingElements) - Method in class de.fu_berlin.ties.extract.ExtractorBase
Evaluates precision and recall for sentence filtering on the last processed document.
evaluateSentenceFiltering() - Method in class de.fu_berlin.ties.extract.Trainer
Evaluates precision and recall for sentence filtering on the last processed document.
EVENT_CONVERTED_TO_EMPTY_TAG - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Converted to empty tag.
EVENT_DELETED_CONTROL_CHARS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Deleted control characters.
EVENT_DELETED_PSEUDO_TAG - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Deleted pseudo-tag.
EVENT_ESCAPED_CHARS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Escaped characters that are illegal or unwanted.
EVENT_INSERTED_MISSING_END_TAG - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Inserted missing end tag.
EVENT_INSERTED_MISSING_ROOT_ELEMENT - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Inserted missing root element.
EVENT_INSERTED_MISSING_START_TAG - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Inserted missing start tag.
EVENT_MOVED_END_TAG_UP - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Moved end tag up.
EVENT_MOVED_START_TAG_DOWN - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Moved start tag dow.
EVENT_QUOTED_ATTRIBUTE_VALUES - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Quoted attribute values.
EVENT_SPLIT_TAG - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Event constant: Split tag.
execute() - Method in class de.fu_berlin.ties.util.ExternalCommand
Executed the external command any further arguments and without sending input and returns its output (standard out).
execute(CharSequence) - Method in class de.fu_berlin.ties.util.ExternalCommand
Executed the external command without any further arguments and returns its output (standard out).
execute(String[]) - Method in class de.fu_berlin.ties.util.ExternalCommand
Executed the external command without sending input and returns its output (standard out).
execute(String[], CharSequence) - Method in class de.fu_berlin.ties.util.ExternalCommand
Executed the external command and returns its output (standard out).
EXT_ANSWERS - Static variable in class de.fu_berlin.ties.extract.AnswerBuilder
The recommended file extension to use for storing answer keys.
EXT_EXTRACTIONS - Static variable in class de.fu_berlin.ties.extract.Extractor
The recommended file extension to use for storing extractions.
EXT_METRICS - Static variable in class de.fu_berlin.ties.eval.MultiFMetrics
Extension recommended for files storing metrics.
EXT_SEPARATOR - Static variable in class de.fu_berlin.ties.io.IOUtils
The character separating the base name from the extension of a file (a dot).
ExtensionFilter - Class in de.fu_berlin.ties.io
A file filter that checks the extension to decide whether to accept a file.
ExtensionFilter(String, boolean) - Constructor for class de.fu_berlin.ties.io.ExtensionFilter
Creates a new instance that accepts only a single extension.
ExtensionFilter(Set<String>, boolean) - Constructor for class de.fu_berlin.ties.io.ExtensionFilter
Creates a new instance.
ExternalClassifier - Class in de.fu_berlin.ties.classify
A proxy that provides a trainable classifier by communicating with an external (non-Java) program.
ExternalClassifier(Set<String>) - Constructor for class de.fu_berlin.ties.classify.ExternalClassifier
Creates a new instance based on the standard configuration.
ExternalClassifier(Set<String>, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.ExternalClassifier
Creates a new instance based on the provided configuration.
ExternalClassifier(Set<String>, FeatureTransformer, File, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.ExternalClassifier
Creates a new instance based on the provided arguments.
ExternalCommand - Class in de.fu_berlin.ties.util
A simple wrapper for external scripts or programs.
ExternalCommand(String[]) - Constructor for class de.fu_berlin.ties.util.ExternalCommand
Creates a new instance, without specifying environment parameters and working directory.
ExternalCommand(String[], File) - Constructor for class de.fu_berlin.ties.util.ExternalCommand
Creates a new instance, without specifying environment parameters.
ExternalCommand(String[], String[], File) - Constructor for class de.fu_berlin.ties.util.ExternalCommand
Creates a new instance.
extract(Document) - Method in class de.fu_berlin.ties.extract.Extractor
Extracts items of interest from the contents of an XML document, based on context representation and local classifier.
Extraction - Class in de.fu_berlin.ties.extract
Extends a Prediction by also storing the extracted text and location data.
Extraction(FieldMap) - Constructor for class de.fu_berlin.ties.extract.Extraction
Creates a new instance from a field map, fulfilling the Storable contract.
Extraction(String, String) - Constructor for class de.fu_berlin.ties.extract.Extraction
Creates a new instance without locating it in a text (using -1 for first token rep + index), setting the probability to -1 ("confirmed") and the evaluation status to EvalStatus.TRUTH.
Extraction(String, TokenDetails) - Constructor for class de.fu_berlin.ties.extract.Extraction
Creates a new instance, setting the probability to -1 ("confirmed") and the evaluation status to EvalStatus.TRUTH.
Extraction(String, TokenDetails, Probability) - Constructor for class de.fu_berlin.ties.extract.Extraction
Creates a new instance, setting the evaluation status to EvalStatus.UNKNOWN.
Extraction(String, TokenDetails, Probability, EvalStatus) - Constructor for class de.fu_berlin.ties.extract.Extraction
Creates a new instance.
ExtractionContainer - Class in de.fu_berlin.ties.extract
A container of Extractions of different classes.
ExtractionContainer(TargetStructure) - Constructor for class de.fu_berlin.ties.extract.ExtractionContainer
Creates a new empty instance.
ExtractionContainer(TargetStructure, FieldContainer) - Constructor for class de.fu_berlin.ties.extract.ExtractionContainer
Creates a new instance from a field container, delegating to ExtractionContainer.restoreEntries(FieldContainer).
ExtractionLocator - Class in de.fu_berlin.ties.extract
Locates extractions in a document.
ExtractionLocator(Document, ExtractionContainer, TextTokenizer) - Constructor for class de.fu_berlin.ties.extract.ExtractionLocator
Creates a new instance.
Extractor - Class in de.fu_berlin.ties.extract
An extractor runs a local Classifier on a list of items/nodes and combines their results using a CombinationStrategy.
Extractor() - Constructor for class de.fu_berlin.ties.extract.Extractor
Creates a new instance using a default extension.
Extractor(String) - Constructor for class de.fu_berlin.ties.extract.Extractor
Creates a new instance.
Extractor(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.Extractor
Creates a new instance.
Extractor(String, File, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.Extractor
Creates a new instance.
Extractor(String, Trainer) - Constructor for class de.fu_berlin.ties.extract.Extractor
Creates a new instance, re-using the components from the provided trainer.
Extractor(String, TargetStructure, Classifier[], Representation, CombinationStrategy, TokenizerFactory, TrainableFilter, Reranker, Set<String>, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.Extractor
Creates a new instance.
ExtractorBase - Class in de.fu_berlin.ties.extract
Common code base shared by Extractorand Trainer.
ExtractorBase(String) - Constructor for class de.fu_berlin.ties.extract.ExtractorBase
Creates a new instance.
ExtractorBase(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.ExtractorBase
Creates a new instance, configuring target structure, classifier, DefaultRepresentation, node filter and combination strategy from the provided configuration.
ExtractorBase(String, File, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.ExtractorBase
Creates a new instance, configuring target structure, classifier, DefaultRepresentation, node filter, combination strategy and tokenizer factory from the provided configuration.
ExtractorBase(String, TargetStructure, Classifier[], Representation, CombinationStrategy, TokenizerFactory, TrainableFilter, Set<String>, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.ExtractorBase
Creates a new instance.

F

Feature - Class in de.fu_berlin.ties.classify.feature
An abstract base class for immutable feature representation used for classification.
Feature() - Constructor for class de.fu_berlin.ties.classify.feature.Feature
Creates a new instance, without storing a comment.
Feature(String) - Constructor for class de.fu_berlin.ties.classify.feature.Feature
Creates a new instance.
FeatureCount - Class in de.fu_berlin.ties.eval
Keeps track of the average number of features and of unique features in context representations and of the average number of contexts in documents.
FeatureCount() - Constructor for class de.fu_berlin.ties.eval.FeatureCount
Creates a new instance.
FeatureCount(FieldMap) - Constructor for class de.fu_berlin.ties.eval.FeatureCount
Creates a new instance from a field map, fulfilling the Storable contract.
FeatureCountView - Interface in de.fu_berlin.ties.eval
Provides a read-only view on the statistics calculated by the FeatureCount class and the underlying raw counts.
FeatureSet - Class in de.fu_berlin.ties.classify.feature
This feature vector implementation stores a multi-set of features.
FeatureSet(String) - Constructor for class de.fu_berlin.ties.classify.feature.FeatureSet
Creates a new instance.
featureSet(FeatureVector) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Converts a feature vector into a FeatureSet (a multi-set of features).
FeatureTransformer - Class in de.fu_berlin.ties.classify.feature
An feature transformer transforms a feature vector in a suitable way.
FeatureTransformer(FeatureTransformer) - Constructor for class de.fu_berlin.ties.classify.feature.FeatureTransformer
Creates a new instance.
FeatureType - Class in de.fu_berlin.ties.context
Type-safe enumeration of the types of features used by the default representation.
FeatureVector - Class in de.fu_berlin.ties.classify.feature
A feature vector contains the features representing an instance.
FeatureVector() - Constructor for class de.fu_berlin.ties.classify.feature.FeatureVector
Creates a new instance.
FieldContainer - Class in de.fu_berlin.ties.io
A container of FieldMaps.
FieldContainer() - Constructor for class de.fu_berlin.ties.io.FieldContainer
Creates a new empty instance.
FieldContainer(StorableContainer) - Constructor for class de.fu_berlin.ties.io.FieldContainer
Creates a new instance and populates it from a StorableContainer.
FieldMap - Class in de.fu_berlin.ties.io
A map targeted at serialization and deserialiation of objects in human-readable formats.
FieldMap() - Constructor for class de.fu_berlin.ties.io.FieldMap
Creates a new instance.
FieldMap(Map<String, Object>) - Constructor for class de.fu_berlin.ties.io.FieldMap
Creates a new instance with the same mappings as the specified map.
FILE_EXT - Static variable in class de.fu_berlin.ties.io.DelimSepValues
The recommended file extension for this format: "dsv" (delimiter-separated values).
FilterEvaluator - Class in de.fu_berlin.ties.filter
A static class that allows to evaluate the precision, recall, and F1 measure of an element filter.
FilteringTokenWalker - Class in de.fu_berlin.ties.filter
A token walker that only invokes a provided TokenProcessor on the subset of tokens that are children of an element accepted by a provided ElementFilter.
FilteringTokenWalker(TokenProcessor, TokenizerFactory, ElementFilter, SkipHandler) - Constructor for class de.fu_berlin.ties.filter.FilteringTokenWalker
Creates a new instance.
filterRepresentation(FeatureVector) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Creates a filtered view of a context representation.
finalize() - Method in class de.fu_berlin.ties.util.TaskRunner
Called by the garbage collector on an object when garbage collection determines that there are no more references to the object.
FinalReextractor - Interface in de.fu_berlin.ties.extract.amend
Implemtations of this interface signal that they are able reanalyse the all extractions in a document and perform suitable amendments to improve results.
findFirst(String) - Method in class de.fu_berlin.ties.xml.TagContainer
Finds the first appearance of a tag.
findInSeries(String, int, boolean) - Method in class de.fu_berlin.ties.xml.TagContainer
Finds the appearance of a tag within a specified markup series.
findTentativeTag(String) - Method in class de.fu_berlin.ties.xml.OpenTags
Finds a tentative appearance of a tag, if any exists.
fixedConstituents(CharSequence) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Returns the constituents of an XML-like document after fixing possible nesting errors etc.
flatten() - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Flattens the contained features into a single character sequence, without including comments.
flatten(boolean) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Flattens a list of features into a single character sequence.
flatten(Configuration) - Method in class de.fu_berlin.ties.TiesConfiguration
Copies all properties contained in this instance to a given configuration.
flatten(boolean[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(byte[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(char[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(double[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(float[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(int[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(long[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(Object[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
flatten(Object[], String) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by the provided separator.
flatten(short[]) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Flattens the elements of the provided array into a single string, separating elements by a space character.
FMetrics - Class in de.fu_berlin.ties.eval
This class manages and updates evaluation results, calculating precision (P), recall (R) and F-measure (F).
FMetrics() - Constructor for class de.fu_berlin.ties.eval.FMetrics
Creates a new empty instance.
FMetrics(EvalInput) - Constructor for class de.fu_berlin.ties.eval.FMetrics
Creates a new instance.
FMetrics(FieldMap) - Constructor for class de.fu_berlin.ties.eval.FMetrics
Creates a new instance from a field map, fulfilling the Storable contract.
FMetrics(long, long, long) - Constructor for class de.fu_berlin.ties.eval.FMetrics
Creates a new instance.
FMetricsSummary - Interface in de.fu_berlin.ties.eval
Implementations of this interface can show statistical summaries of precision, recall, and F1 metrics updated in several operations.
FMetricsView - Interface in de.fu_berlin.ties.eval
A read-only view of the evaluation results calculated by the FMetrics class and the underlying raw counts.
forceRemove(TagConstituent) - Method in class de.fu_berlin.ties.xml.TagContainer
Forces the removal of a single appearance of a tag from the container.
format(double) - Static method in class de.fu_berlin.ties.util.Util
Formats the given number, using at most 4 fraction digits.
format(double, int) - Static method in class de.fu_berlin.ties.util.Util
Formats the given number, using at most the specified number of fraction digits.
formatDurationInMillisecs(long) - Static method in class de.fu_berlin.ties.util.Util
Formats a duration.
formatDurationInSeconds(double) - Static method in class de.fu_berlin.ties.util.Util
Formats a duration.

G

get(K) - Method in class de.fu_berlin.ties.util.MultiValueMap
Returns the collection of values to which this map maps the specified key.
getAcceptedElements() - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
Returns the set of elements that have been accepted by the filter in the current document.
getAccuracy() - Method in class de.fu_berlin.ties.eval.Accuracy
Calculates and returns the accuracy: A = true / (true + false).
getAccuracy() - Method in interface de.fu_berlin.ties.eval.AccuracyView
Calculates and returns the accuracy: A = true / (true + false).
getActiveClasses() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the set of candidate classes to consider for the current element for each classifier.
getAllChildren() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns the number of all children elements in the parent element.
getAllClasses() - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Returns the set of all valid classes.
getAncestorNumber() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns the maximum number of ancestors to include in the context representation.
getAttribute(String) - Method in class de.fu_berlin.ties.io.FieldContainer
Returns the value of an attribute.
getAverageContexts() - Method in class de.fu_berlin.ties.eval.FeatureCount
Calculates and returns the average number of context representations in a document.
getAverageContexts() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Calculates and returns the average number of context representations in a document.
getAverageFeatures() - Method in class de.fu_berlin.ties.eval.FeatureCount
Calculates and returns the average number of non-comment features in a context representation.
getAverageFeatures() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Calculates and returns the average number of non-comment features in a context representation.
getAverageUniqueFeatures() - Method in class de.fu_berlin.ties.eval.FeatureCount
Calculates and returns the average number of unique non-comment features in a context representation.
getAverageUniqueFeatures() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Calculates and returns the average number of unique non-comment features in a context representation.
getBackgroundClass() - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Returns the "background" class of this classifier.
getBaseClassNames() - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns the set of names of the target classes at the top of the inheritance hierarchy.
getBaseName(File) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the base name of a file (the local name without IOUtils.getExtension(File) and preceding dot).
getBaseName(URL) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the base name of an URL (the local name without IOUtils.getExtension(File) and preceding dot).
getBaseName() - Method in class de.fu_berlin.ties.util.TaskRunner
Returns the base name of worker threads.
getBaseTargetClasses() - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns the TargetClasses at the top of the inheritance hierarchy.
getCachedFeatures() - Method in class de.fu_berlin.ties.context.PriorRecognitions.Pair
Returns the list of features representing the recognition, initially null.
getCandidateNumber() - Method in class de.fu_berlin.ties.util.AdaptableLRUMap
Returns the number of candidates considered for each pruning operation.
getCharacters() - Method in class de.fu_berlin.ties.eval.FeatureCount
Returns the number of characters counted so far.
getCharacters() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Returns the number of characters counted so far.
getCharactersPerContext() - Method in class de.fu_berlin.ties.eval.FeatureCount
Calculates and returns the average number of characters in a context representation.
getCharactersPerContext() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Calculates and returns the average number of characters in a context representation.
getCharactersPerFeature() - Method in class de.fu_berlin.ties.eval.FeatureCount
Calculates and returns the average number of characters in a feature.
getCharactersPerFeature() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Calculates and returns the average number of characters in a feature.
getCharset() - Method in class de.fu_berlin.ties.io.ContentType
Returns the character set.
getClassifiers() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the array of classifiers used for the local classification decisions.
getClassNames() - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns the set of names of the defined target classes.
getComment() - Method in class de.fu_berlin.ties.classify.feature.Feature
Returns the comment attached to this feature, if any.
getComment() - Method in class de.fu_berlin.ties.context.GlobalFeature
Returns the comment attached to this feature, if any.
getConfig() - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Returns the configuration used by this instance.
getConfig() - Method in class de.fu_berlin.ties.ConfigurableProcessor
Returns the configuration used by this instance.
getConfig() - Method in class de.fu_berlin.ties.context.sensor.BaseSensor
Returns the configuration used by this instance.
getContext() - Method in class de.fu_berlin.ties.extract.amend.ContextDetails
Returns a feature vector representing the context of the token.
getContexts() - Method in class de.fu_berlin.ties.eval.FeatureCount
Returns the number of representations evaluated so far.
getContexts() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Returns the number of representations evaluated so far.
getCount(String) - Method in class de.fu_berlin.ties.text.TokenContainer
Returns the cardinality of the given token in this container.
getCount(String) - Method in class de.fu_berlin.ties.text.TokenCounter
Returns the cardinality of the given token in this container.
getCurrentExtraction() - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Returns the current extraction.
getDefaultAttributes() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns the unmodifiable set of names of default attributes.
getDefaultDirectory() - Static method in class de.fu_berlin.ties.io.IOUtils
Returns an default directory that can be given as "parent" when constructing file objects.
getDefaultPriority() - Method in class de.fu_berlin.ties.util.TaskRunner
Returns the default priority to use for threads.
getDefaultProperties() - Static method in class de.fu_berlin.ties.util.VelocityService
Returns properties listing the changes to the Velocity configuration made by default.
getDemotion() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns the promotion factor used by the algorithm.
getDescription() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Returns a textual description of the entry.
getDescriptor(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Returns the descriptor for a given key, if any is given in the descriptor configuration.
getDetailedRecognitions() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns the number of preceding recognitions to represent in detail.
getDocuments() - Method in class de.fu_berlin.ties.eval.FeatureCount
Returns the number of documents counted so far.
getDocuments() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Returns the number of documents counted so far.
getElement(int) - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Reads the value of this property as a string list and returns the element at the specified position.
getElementName() - Method in class de.fu_berlin.ties.context.LocalFeature
Returns the name of the element this feature is attached to.
getElementsFrom(int) - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Reads the value of this property as a string list and returns a sublist starting at the specified position.
getEvalStatus() - Method in class de.fu_berlin.ties.classify.Prediction
Returns the evaluation status of this instance.
getEvaluated(Integer) - Method in class de.fu_berlin.ties.extract.TrainEval.Results
Returns one of the stored evaluated extraction containers.
getEvaluated(int) - Method in class de.fu_berlin.ties.extract.TrainEval.Results
Returns one of the stored evaluated extraction containers.
getExpectedState() - Method in class de.fu_berlin.ties.extract.amend.TrainingContextDetails
Returns the expected (true) CombinationState for this instance.
getExtension(File) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the extension of a file.
getExtension(URL) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the file extension of an URL.
getExtensions() - Method in class de.fu_berlin.ties.io.ExtensionFilter
The immutable set of extensions (Strings) accepted by this filter.
getExtractFeatureCV() - Method in class de.fu_berlin.ties.extract.TrainEval.Results
Returns a read-only view on the feature count statistics collected during training.
getF1Measure() - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the F-measure, setting alpha = 0.5 so P and R are weighted equal ("F1 measure").
getF1Measure() - Method in interface de.fu_berlin.ties.eval.FMetricsView
Returns the F-measure, setting alpha = 0.5 so P and R are weighted equal ("F1 measure").
getFactory() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the factory used to instantiate tokenizers.
getFalseCount() - Method in class de.fu_berlin.ties.eval.Accuracy
Returns the number of false decisions.
getFalseCount() - Method in interface de.fu_berlin.ties.eval.AccuracyView
Returns the number of false decisions.
getFalseNeg() - Method in interface de.fu_berlin.ties.eval.EvalInput
Returns the number of false negatives (false rejections).
getFalseNeg() - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the number of false negatives (false rejections).
getFalsePos() - Method in interface de.fu_berlin.ties.eval.EvalInput
Returns the number of false positives (false acceptances).
getFalsePos() - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the number of false positives (false acceptances).
getFeatureCount() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the object used to count documents, contexts, and features and to calculate averages.
getFeatures() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns vector of features representing the currently processed element.
getFeatureSum() - Method in class de.fu_berlin.ties.eval.FeatureCount
Returns the number of non-comment features encountered so far.
getFeatureSum() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Returns the number of non-comment features encountered so far.
getFilter() - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
Returns the element filter used by this instance.
getFirstTokenInLastIndex() - Method in class de.fu_berlin.ties.text.TokenContainer
Returns the index of the first token of the last added string in the original text (indexing starts with 0).
getFirstTokenInLastRep() - Method in class de.fu_berlin.ties.text.TokenContainer
Returns the repetition of the first token of the last added string in the original text (counting starts with 0, as the first occurrence is the "0th repetition").
getFirstTokenRep() - Method in class de.fu_berlin.ties.extract.Extraction
Returns the repetition of the first token of the extraction in the original text (counting starts with 0, as the first occurrence is the "0th repetition"), -1 if unknown or if Extraction.isFirstTokenRepIgnored() is true.
getFMeasure(double) - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the F-measure: F = 1 / (alpha * (1/P) + (1-alpha) * (1/R)).
getFMeasure(double) - Method in interface de.fu_berlin.ties.eval.FMetricsView
Returns the F-measure: F = 1 / (alpha * (1/P) + (1-alpha) * (1/R)).
getFullRepresentation() - Method in class de.fu_berlin.ties.classify.feature.Feature
Prints a full representation of this feature that contains both representation (if any) and comment (if any).
getFullRepresentation(String) - Method in class de.fu_berlin.ties.classify.feature.Feature
Prints a full representation of this feature that contains both representation (if any) and comment (if any).
getHeadAttribute() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns the name of the element to use for calculating head values.
getHeadElement() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns the name of the attribute to use for calculating head values.
getIndex() - Method in class de.fu_berlin.ties.extract.Extraction
Returns the index of the first token in the text (indexing starts with 0); or -1 if unknown/irrelevant.
getIndex() - Method in class de.fu_berlin.ties.text.TokenDetails
Returns the index of the token in the original text (indexing starts with 0).
getKey() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Returns the key (name) of this entry.
getLast() - Method in class de.fu_berlin.ties.text.TokenContainer
Returns a trimmed and whitespace-normalized representation of the string added this container by the last TokenContainer.add(String) operation.
getLast() - Method in class de.fu_berlin.ties.text.TokenCounter
Returns the last added token.
getLastCount(String) - Method in class de.fu_berlin.ties.text.TokenContainer
Returns the cardinality of the given token in the text added by the last TokenContainer.add(String) operation.
getLastRep() - Method in class de.fu_berlin.ties.text.TokenCounter
Returns the repetition of the last added token in the original text (counting starts with 0, as the first occurrence is the "0th repetition").
getLength() - Method in class de.fu_berlin.ties.classify.feature.OSBTransformer
Returns the maximum number of original features joined.
getLength() - Method in class de.fu_berlin.ties.classify.feature.SBPHTransformer
Returns the maximum number of original features joined.
getLocalFeature() - Method in class de.fu_berlin.ties.context.GlobalFeature
Returns the local feature wrapped by this global feature.
getLocalName(URL, boolean) - Static method in class de.fu_berlin.ties.io.IOUtils
Returns the local name of an URL.
getMark() - Method in class de.fu_berlin.ties.context.FeatureType
Returns the mark used to introduce features of this type.
getMarkupSeriesNo() - Method in class de.fu_berlin.ties.xml.TagConstituent
Returns the number of the markup series this tag is part of.
getMimeType() - Method in class de.fu_berlin.ties.io.ContentType
Returns the MIME type.
getName() - Method in class de.fu_berlin.ties.context.LocalFeature
Returns the name of this feature.
getName() - Method in class de.fu_berlin.ties.eval.EvalStatus
Returns the name of this instance.
getName() - Method in class de.fu_berlin.ties.extract.TargetClass
Returns the name of this class.
getName() - Method in class de.fu_berlin.ties.xml.TagConstituent
Returns the name of this tag.
getNames() - Method in class de.fu_berlin.ties.xml.dom.ElementNameFilter
Returns the set of element names accepted by this filter.
getNestedContainer(String) - Method in class de.fu_berlin.ties.io.FieldContainer
Returns a nested subcontainer managed by this instance.
getNormalizedWhitespace() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns the normalized whitespace representation prepended if TextTokenizer.isNormalizedWhitespacePrepended() is true.
getOutFileExt() - Method in class de.fu_berlin.ties.TextProcessor
Returns the extension used for output files.
getOverallPosition() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns the position of the represented element in the parent element, counting all children.
getPR() - Method in class de.fu_berlin.ties.classify.Probability
Returns the optional pR: pR = log(p / (1-p)); or Double.NaN if not known/not relevant.
getPrecedingTransformer() - Method in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Returns the preceding transformer used if this transformer is part of a chain.
getPrecision() - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the precision: P = tp / (tp + fp).
getPrecision() - Method in interface de.fu_berlin.ties.eval.FMetricsView
Returns the precision: P = tp / (tp + fp).
getPredictedExtractions() - Method in class de.fu_berlin.ties.extract.Extractor
Returns the extraction container used for storing the predicted extractions.
getPredictedState() - Method in class de.fu_berlin.ties.extract.amend.TrainingContextDetails
Returns the CombinationState predicted by the current classification model for this token if there was an classification error; or null if the expected state was correctly predicted.
getPriorRecognitions() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the buffer of preceding Recognitions from the current document.
getProb() - Method in class de.fu_berlin.ties.classify.Probability
Returns the actual probability; will be in value in the range from 0 to 1; or -1 if the actual value is unknown/not applicable.
getProbability() - Method in class de.fu_berlin.ties.classify.Prediction
Calculates and returns the average probability.
getProcessedFollowing() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns the number of preceding siblings that were processed by the context representation.
getProcessedPreceding() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns the number of following siblings that were processed by the context representation.
getPromotion() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns the demotion factor used by the algorithm.
getPropertyDirect(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Read a property from this configuration.
getPruneNumber() - Method in class de.fu_berlin.ties.util.AdaptableLRUMap
Returns the number of elements removed by each pruning operation.
getRawScore() - Method in class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Returns the raw score (activation value) of this prediction.
getRawThreshold() - Method in class de.fu_berlin.ties.classify.winnow.WinnowDistribution
Returns the raw threshold used by the Winnow classifier for this instance.
getRecall() - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the recall: R = tp / (tp + fn).
getRecall() - Method in interface de.fu_berlin.ties.eval.FMetricsView
Returns the recall: R = tp / (tp + fn).
getRecognition() - Method in class de.fu_berlin.ties.context.PriorRecognitions.Pair
Returns the recognition wrapped in this pair.
getRecognitionNumber() - Method in class de.fu_berlin.ties.context.Representation
Returns the number of preceding recognitions to represent.
getRejectedElements() - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
Returns the set of elements that have been rejected by the filter in the current document.
getRep() - Method in class de.fu_berlin.ties.text.TokenDetails
Returns the repetition of the token in the original text (counting starts with 0, as the first occurrence is the "0th repetition").
getRepresentantion() - Method in class de.fu_berlin.ties.xml.XMLConstituent
Returns the representation of this tag within the XML document.
getRepresentation() - Method in class de.fu_berlin.ties.classify.feature.DefaultFeature
Returns the representation of this feature, to be used for classification.
getRepresentation() - Method in class de.fu_berlin.ties.classify.feature.Feature
Abstract method for returning the representation of this feature, to be used for classification.
getRepresentation() - Method in class de.fu_berlin.ties.context.GlobalFeature
Returns the representation of this feature, to be used for classification.
getRepresentation() - Method in class de.fu_berlin.ties.context.LocalFeature
Returns the representation of this feature, to be used for classification.
getRepresentation() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the context representation used for local classifications.
getSentenceFilter() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the filter used in the first step of a double classification approach ("sentence filtering").
getSeparator() - Method in class de.fu_berlin.ties.classify.feature.OSBTransformer
Returns the string used to separate original features (by default a space character).
getSeparator() - Method in class de.fu_berlin.ties.classify.feature.SBPHTransformer
Returns the string used to separate original features (by default a space character).
getSiblingNumber() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns the basic number of preceding and following siblings to include in the context representation.
getSigmoidScore() - Method in class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Returns the sigmoid score (activation value) of this prediction.
getSource() - Method in class de.fu_berlin.ties.classify.Prediction
Returns the identifier of the source of this prediction (e.g., the file name).
getSplitMaximum() - Method in class de.fu_berlin.ties.context.AbstractRepresentation
Returns the maximum number of subsequences to keep when a feature value must be split (at whitespace).
getStoreN() - Method in class de.fu_berlin.ties.context.AbstractRepresentation
Each storeN-th context representation is stored for debugging and inspection purposes (if > 0, otherwise no representation is stored).
getStrategy() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the combination strategy used.
getStrength() - Method in class de.fu_berlin.ties.classify.feature.DefaultFeature
Returns a strength value for this feature.
getStrength() - Method in class de.fu_berlin.ties.classify.feature.Feature
Returns a strength value for this feature.
getStrengthType() - Method in class de.fu_berlin.ties.classify.feature.FeatureSet
Returns the type of method used to consider feature frequencies when determining strength values.
getStringArray(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Get an array of strings associated with the given configuration key.
getSummedStrength() - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Returns a sum of the strength values of all features contained in this vector.
getTagMap() - Method in class de.fu_berlin.ties.xml.TagContainer
Grants subclasses direct access to the map that is used internally for managing the tags.
getTargetClasses() - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns the defined TargetClasses.
getTargetStructure() - Method in class de.fu_berlin.ties.extract.AnswerBuilder
Returns the target structure specifying the classes to recognize.
getTargetStructure() - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns the target structure specifying the classes to recognize.
getTargetStructure() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the target structure specifying the classes to recognize.
getTestSplit() - Method in class de.fu_berlin.ties.extract.TrainEval
Returns the percentage of a corpus to use for testing (evaluation).
getText() - Method in interface de.fu_berlin.ties.context.Recognition
Returns the recognized text fragment.
getText() - Method in class de.fu_berlin.ties.extract.Extraction
Returns the extracted text fragment.
getThreshold() - Method in class de.fu_berlin.ties.classify.winnow.WinnowDistribution
Returns the threshold used by the Winnow classifier for this instance.
getThresholdThickness() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns the thickness of the threshold if the "thick threshold" heuristic is used.
getToken() - Method in class de.fu_berlin.ties.text.TokenDetails
Returns the token itself.
getTrainFeatureCV() - Method in class de.fu_berlin.ties.extract.TrainEval.Results
Returns a read-only view on the feature count statistics collected during extraction.
getTrainSplit() - Method in class de.fu_berlin.ties.extract.TrainEval
Returns the percentage of a corpus to use for training; the remaining documents (1-x) are used for evaluation.
getTrueCount() - Method in class de.fu_berlin.ties.eval.Accuracy
Returns the number of true decisions.
getTrueCount() - Method in interface de.fu_berlin.ties.eval.AccuracyView
Returns the number of true decisions.
getTruePos() - Method in interface de.fu_berlin.ties.eval.EvalInput
Returns the number of true positives (correct recognitions).
getTruePos() - Method in class de.fu_berlin.ties.eval.FMetrics
Returns the number of true positives (correct recognitions).
getType() - Method in class de.fu_berlin.ties.classify.Prediction
Returns the predicted type (class).
getType() - Method in class de.fu_berlin.ties.combi.CombinationState
Returns the type (class) of the current instance; or null if we're outside of any instance (CombinationState.OUTSIDE).
getType() - Method in class de.fu_berlin.ties.context.LocalFeature
Returns the type of this feature.
getType() - Method in interface de.fu_berlin.ties.context.Recognition
Returns the type (class) of the recognition.
getType() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Returns the base type of the entry (String, Integer etc.).
getType() - Method in class de.fu_berlin.ties.xml.XMLConstituent
Returns the type of this constituent.
getTypedChildren() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns the number of children elements in the parent element with the type (qualified name) of the represented element.
getTypedPosition() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns the position of the represented element in the parent element, counting only children with the same type (qualified name).
getUniqueFeatureSum() - Method in class de.fu_berlin.ties.eval.FeatureCount
Returns the number of non-comment non-duplicate features encountered so far.
getUniqueFeatureSum() - Method in interface de.fu_berlin.ties.eval.FeatureCountView
Returns the number of non-comment non-duplicate features encountered so far.
getValidClasses() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Returns the set of valid class names.
getValue() - Method in class de.fu_berlin.ties.context.LocalFeature
Returns the value wrapped by this feature.
getValue() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Returns the value of this property (as an object).
getValue() - Method in class de.fu_berlin.ties.util.MutableInt
Returns the wrapped int value.
getVariety() - Method in class de.fu_berlin.ties.xml.TagConstituent
Returns the TagVariety of this tag.
getVisibleChars() - Method in class de.fu_berlin.ties.extract.Extraction
Returns the visible characters of the text fragment (everything except whitespace and control characters).
getWalker() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns the token walker used to walk thru documents.
getWeights(Integer) - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Returns the weights of a feature.
getWrappedComparator() - Method in class de.fu_berlin.ties.util.InvertedComparator
Returns the comparator wrapped and inverted by this instance.
GlobalFeature - Class in de.fu_berlin.ties.context
An immutable representation of a feature that can be used for classification.
GlobalFeature(String, LocalFeature) - Constructor for class de.fu_berlin.ties.context.GlobalFeature
Creates a new instance that combines a local feature with an axis prefix.
GlobalFeature(String) - Constructor for class de.fu_berlin.ties.context.GlobalFeature
Creates a new instance that contains only a comment.
globalize(String, Iterator<LocalFeature>, LinkedList<Feature>, boolean) - Static method in class de.fu_berlin.ties.context.GlobalFeature
Converts a series of LocalFeatures into global features, adding the created global features to a linked list.
GOAL_TEMPLATE - Static variable in class de.fu_berlin.ties.util.VelocityService
Name of the default template used to print goals.

H

handleAccept(Element, Element, boolean) - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
This method can be overwritten by subclasses to modify decisions of the element filter.
handleAccept(Element, Element, boolean) - Method in class de.fu_berlin.ties.filter.TrainableFilteringTokenWalker
This method can be overwritten by subclasses to modify decisions of the element filter. This implementation relies on the oracle to make the final decision and joins the predicted decision and the correct decision via OR.
handleAncestors(Element, int, int, LinkedList<Feature>, LinkedList<Feature>, Bag, Map<Element, List<LocalFeature>>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Handles ancestors and ancestor siblings of an element.
handleEndTag(TagConstituent, OpenTags, UnprocessedTags) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Helper method for handling an end tag.
handleEOF(XMLConstituent, OpenTags, UnprocessedTags, boolean) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Helper method for handling an the end of a file.
handleSiblings(String, Element, int, LinkedList<Feature>, LinkedList<Feature>, Map<Element, List<LocalFeature>>) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Adds the preceding and following siblings of an element.
hashCode() - Method in class de.fu_berlin.ties.classify.feature.Feature
Returns a hash code value for this object, fulfulling the Object.hashCode() contract.
hashCode() - Method in class de.fu_berlin.ties.classify.Prediction
Returns a hash code value for this object, fulfulling the Object.hashCode() contract.
hashCode() - Method in class de.fu_berlin.ties.combi.CombinationState
Returns a hash code value for this object, fulfulling the Object.hashCode() contract.
hashCode() - Method in class de.fu_berlin.ties.extract.Extraction
Returns a hash code value for this object, fulfulling the Object.hashCode() contract.
hashCode() - Method in class de.fu_berlin.ties.extract.TargetClass
Returns a hash code value for this object, fulfulling the Object.hashCode() contract.
hashCode() - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns a hash code value for this object, fulfulling the Object.hashCode() contract.
hashCode() - Method in class de.fu_berlin.ties.util.MultiValueMap
Returns the hash code value for this map.
hasNext() - Method in class de.fu_berlin.ties.util.ListEntry
Returns true if there is a next element.
hasPrecedingWhitespace() - Method in class de.fu_berlin.ties.text.TextTokenizer
Whether the token returned by the last call to TextTokenizer.nextToken() is preceded by whitespace (i.e., text not matched by any token).
hasPrevious() - Method in class de.fu_berlin.ties.util.ListEntry
Returns true if there is a previous element.

I

IGNORED - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for predictions that were ignored.
incFalseCount() - Method in class de.fu_berlin.ties.eval.Accuracy
Increases the number of false decisions by one.
incFalseNeg() - Method in class de.fu_berlin.ties.eval.FMetrics
Increases the number of false negatives by 1.
incFalseNeg(String) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Increases the number of false negatives for the given type by 1.
incFalsePos() - Method in class de.fu_berlin.ties.eval.FMetrics
Increases the number of false positives by 1.
incFalsePos(String) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Increases the number of false positives for the given type by 1.
incTrueCount() - Method in class de.fu_berlin.ties.eval.Accuracy
Increases the number of true decisions by one.
incTruePos() - Method in class de.fu_berlin.ties.eval.FMetrics
Increases the number of true positives by 1.
incTruePos(String) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Increases the number of true positives for the given type by 1.
inExtraction() - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Whether we are currently within an extraction.
init(Document) - Method in interface de.fu_berlin.ties.filter.ElementFilter
Resets the filter to start processing a new document.
init(Document) - Method in class de.fu_berlin.ties.filter.RepresentationFilter
Resets the filter to start processing a new document. This method must be called for each document to be procesed by this filter prior to calling ElementFilter.matches(Element)or ElementFilter.prefers(Element)on elements of this document.
init(Document) - Method in class de.fu_berlin.ties.filter.TrainableFilter
Resets the filter to start processing a new document.
initExtractor(Trainer) - Method in class de.fu_berlin.ties.extract.TrainEval
Creates and initializes a extractor to use for an evaluation run, re-using the components of the provided trainer.
initFields() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Initializes the fields used for processing a document (feature cache, buffer of prior recognitions, token walker, and statistics) and resets the combination strategy.
initialWhitespaceCount(String) - Method in class de.fu_berlin.ties.text.TextTokenizer
Convenience method that counts the number of whitespace characters at the begin of a string, according to the defined whitespace pattern.
initScores() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Initializes the score (activation values) to use for all classes.
initTrainer(File) - Method in class de.fu_berlin.ties.extract.TrainEval
Creates and initializes a trainer to use for an evaluation run, configured from the stored configuration.
initWeight() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns the initial weight to use for each feature per class.
initWeightArray() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns the initial weight array to use for a feature for all classes.
insertAfter(ListEntry) - Method in class de.fu_berlin.ties.util.ListEntry
Inserts a new element after the current one, adjusting the list structure accordingly.
insertBefore(ListEntry) - Method in class de.fu_berlin.ties.util.ListEntry
Inserts a new element before the current one, adjusting the list structure accordingly.
InsideOutsideStrategy - Class in de.fu_berlin.ties.combi
A combination strategy using inside/outside tagging.
InsideOutsideStrategy(Set<String>) - Constructor for class de.fu_berlin.ties.combi.InsideOutsideStrategy
Creates a new instance, using "O/I/B" mode: the "B-" is only used where required for disambiguation, otherwise the "I-" prefix is used.
InsideOutsideStrategy(Set<String>, boolean) - Constructor for class de.fu_berlin.ties.combi.InsideOutsideStrategy
Creates a new instance, using "O/I/B" mode (setting InsideOutsideStrategy.isBStartingAll() to false).
InvertedComparator<T> - Class in de.fu_berlin.ties.util
Compares objects, inverting the order of a wrapped comparator.
InvertedComparator(Comparator<T>) - Constructor for class de.fu_berlin.ties.util.InvertedComparator
Creates a new instance, inverting the order returned by the provided comparator.
invoke(Runnable, String) - Method in class de.fu_berlin.ties.util.TaskRunner
Invokes a task to be executed asynchronously.
invoke(Runnable, String, int) - Method in class de.fu_berlin.ties.util.TaskRunner
Invokes a task to be executed asynchronously.
invokeDefault(Runnable, String) - Static method in class de.fu_berlin.ties.util.TaskRunner
Invokes a task to be executed asynchronously, using the default task runner.
invokeDefault(Runnable, String, int) - Static method in class de.fu_berlin.ties.util.TaskRunner
Invokes a task to be executed asynchronously using the default task runner.
IOUtils - Class in de.fu_berlin.ties.io
A static class that provides utility constants and methods for I/O.
isAnEmptiableTag(String) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Whether the specified tag is one of the tags that can be converted an empty tags when required for fixing a document.
isAnswerState() - Method in class de.fu_berlin.ties.eval.EvalStatus
Whether this state is suitable for an answer key.
isBalanced() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Whether the Balanced Winnow or the standard Winnow algorithm is used.
isBegin() - Method in class de.fu_berlin.ties.combi.CombinationState
Returns whether this is the begin of an instance.
isBStartingAll() - Method in class de.fu_berlin.ties.combi.InsideOutsideStrategy
Whether the "B-" prefix is used for the first item of each instance ("O/B/I" mode) or only for the first item of instances immediately following an instance of the same class ("O/I/B" mode).
isCalculatingSummaries() - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Whether summaries are calculated by this type.
isCaseIgnored() - Method in class de.fu_berlin.ties.io.ExtensionFilter
Whether case is ignored when matching the extension.
isClosed() - Method in class de.fu_berlin.ties.util.TaskRunner
Returns whether this task runner has been closed.
isDeletingControlChars() - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Whether control characters are deleted (these characters are not allowed in XML 1.0 and discouraged in XML 1.1).
isDeletingPseudoTags() - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Whether "pseudo-tags" are deleted, i.e., sequences that cannot be parsed as tags but look similar to them.
isDiscardPreceding() - Method in class de.fu_berlin.ties.combi.CombinationState
Signals that the extraction in progress during the preceding combination state should be discarded.
isEmpty() - Method in class de.fu_berlin.ties.util.MultiValueMap
Returns true if this map contains no key-value mappings.
isEmpty() - Method in class de.fu_berlin.ties.xml.TagContainer
Whether this tag container is empty.
isEnd() - Method in class de.fu_berlin.ties.combi.CombinationState
Returns whether this is the end of an instance.
isEscapingPseudoEntities() - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Whether to escape "&" starting a possible nonstandard entity reference ("&" at the start of one of the 5 predefined entity references or a character reference is never escaped, all other "&" are always escaped).
isFirstTokenRepIgnored() - Method in class de.fu_berlin.ties.extract.Extraction
Whether the repetition of the first token should be ignored, comparing only the text but not the position of extractions.
isList() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Whether the entry is a list (can contain multiple values).
isMatchingAll() - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Returns the match mode.
isMatchingPosition() - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
If true, the positions of extraction and answer keys must match; otherwise only their contents must match (string compare).
isNormalizedWhitespacePrepended() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns whether whitespace is prepended in a normalized form (TextTokenizer.getNormalizedWhitespace()) to those tokens where TextTokenizer.hasPrecedingWhitespace() would return true.
isolateTags(Reader, Writer) - Method in class de.fu_berlin.ties.xml.TagIsolator
"Isolates" XML tags and textual contents in an XML document by printing each XML tag on a single line.
isolateTags(Reader, Writer, Map) - Method in class de.fu_berlin.ties.xml.TagIsolator
"Isolates" XML tags and textual contents in an XML document by printing each XML tag on a single line.
isOptional() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Whether the entry is optional.
isPredictionState() - Method in class de.fu_berlin.ties.eval.EvalStatus
Whether this state is suitable for a prediction.
isPreserving() - Method in class de.fu_berlin.ties.classify.feature.OSBTransformer
Whether original features are preserved as well in addition to the generated joint features.
isRelevant(String) - Method in class de.fu_berlin.ties.extract.ExtractorBase
Checks whether a token is relevant for training and extraction.
isRoot(TagConstituent) - Method in class de.fu_berlin.ties.xml.OpenTags
Checks whether a tag is the root tag (the first tag inserted into this container).
isSealed() - Method in interface de.fu_berlin.ties.context.Recognition
Whether this recognition has been sealed.
isSealed() - Method in class de.fu_berlin.ties.extract.Extraction
Whether this extraction has been sealed.
isSentenceFiltering() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Whether this instance uses sentence filtering (classification of relevant versus irrelevant sentences in a double classification approach).
isTestingOnly() - Method in class de.fu_berlin.ties.extract.Trainer
If true the trainer only ensures that all answer keys exist and can be located in the document instead of doing any training.
isTrainingEnabled() - Method in class de.fu_berlin.ties.filter.TrainableFilteringTokenWalker
Returns true if training the embedded filter is enabled (default).
isTrainingOnlyErrors() - Method in class de.fu_berlin.ties.extract.Trainer
Whether to train only errors (TOE mode, recommmended) or to train all instances (brute-force mode).
isValidWhitespace(String) - Method in class de.fu_berlin.ties.text.TextTokenizer
Convenience method that checks whether a string matches the defined whitespace pattern.
isWhitespaceAfterLast() - Method in class de.fu_berlin.ties.text.TokenContainer
Whether there is whitespace after the last added string.
isWhitespaceAfterLast() - Method in class de.fu_berlin.ties.text.TokenCounter
Whether there is whitespace after the last added token.
isWhitespaceBefore() - Method in class de.fu_berlin.ties.text.TokenDetails
Whether there is whitespace before the token.
isWhitespaceBeforeLast() - Method in class de.fu_berlin.ties.text.TokenContainer
Whether there is whitespace before the last added string.
isWhitespaceBeforeLast() - Method in class de.fu_berlin.ties.text.TokenCounter
Whether there is whitespace before the last added token.
isWhitespacePatternEnsured() - Method in class de.fu_berlin.ties.text.TextTokenizer
Whether whitespace (the text between patterns) is checked to ensure that the defined whitespace pattern is matched.
iterations() - Method in class de.fu_berlin.ties.extract.TrainEval.Results
Returns an iterator over the TUNE iterations for which extraction results are stored.
iterator() - Method in class de.fu_berlin.ties.classify.feature.FeatureSet
Returns an iterator over the Features stored in this vector.
iterator() - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Returns an iterator over the Features stored in this vector.
iterator() - Method in class de.fu_berlin.ties.classify.PredictionDistribution
Returns an iterator over the predictions in this distribution, in descending order (most probably class comes first).
iterator() - Method in class de.fu_berlin.ties.context.PriorRecognitions
Returns an iterator over the elements stored in this instance.
iterator() - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns an iterator over all extractions in insertion order.
iterator(String) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns an iterator over the extractions of a specified class, in insertion order.

J

JAVA_CLASS_ATTRIBUTE - Static variable in class de.fu_berlin.ties.xml.io.ObjectElement
The name of the attribute used to store the Java class of an object: "java".
javaClass() - Method in class de.fu_berlin.ties.xml.io.ObjectElement
Returns the Class of the stored object.
javaClassName() - Method in class de.fu_berlin.ties.xml.io.ObjectElement
Returns the name of the Class of the stored object.
joinAlternatives(String[]) - Static method in class de.fu_berlin.ties.text.TextUtils
Helper method for building a regular expression Pattern by combining several alternatives.
joinKey(String, String) - Static method in class de.fu_berlin.ties.TiesConfiguration
Creates a full key by joining a prefix and a suffix string, separated by the property delimiter.

K

KEY_ACCURACY - Static variable in class de.fu_berlin.ties.eval.Accuracy
Serialization key for the accuracy.
KEY_ANSWERS - Static variable in class de.fu_berlin.ties.extract.AnswerBuilder
Context key referring to the extraction container used for storing the answer keys.
KEY_AVERAGE_CONTEXTS - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for the average number of context representations in a document.
KEY_AVERAGE_FEATURES - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for the average number of features in a context representation.
KEY_AVERAGE_UNIQUE_FEATURES - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for the average number of unique features in a context representation.
KEY_CHARS - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for characters.
KEY_CHARS_PER_CONTEXT - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for the average number of characters in a context representation.
KEY_CHARS_PER_FEATURE - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for the average number of characters in a feature.
KEY_CLASS - Static variable in class de.fu_berlin.ties.classify.ClassTrain
Serialization key for the correct class.
KEY_CLASSIFICATION - Static variable in class de.fu_berlin.ties.classify.ClassTrain
Serialization key for the result of the classification: either ClassTrain.CORRECT_CLASS if the correct class was predicted or the wrongly predicted class in case of an error.
KEY_CONTEXTS - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for context representations.
KEY_DIRECTORY - Static variable in class de.fu_berlin.ties.TextProcessor
Context key referring to the directory of the processed document, if it is a local file.
KEY_DOCUMENTS - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for documents.
KEY_EVAL_STATUS - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Serialization key recommended for instances of this class.
KEY_F1_MEASURE - Static variable in class de.fu_berlin.ties.eval.FMetrics
Serialization key for the F1 measure.
KEY_FALSE_COUNT - Static variable in class de.fu_berlin.ties.eval.Accuracy
Serialization key for the number of false items.
KEY_FALSE_NEG - Static variable in class de.fu_berlin.ties.eval.FMetrics
Serialization key for false negatives.
KEY_FALSE_POS - Static variable in class de.fu_berlin.ties.eval.FMetrics
Serialization key for false positives.
KEY_FEATURES - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for features.
KEY_FILE - Static variable in class de.fu_berlin.ties.classify.ClassTrain
Serialization key for the name of the file to classify.
KEY_FIRST_TOKEN_REP - Static variable in class de.fu_berlin.ties.extract.Extraction
Serialization key for the repetition of the first token.
KEY_INDEX - Static variable in class de.fu_berlin.ties.extract.Extraction
Serialization key for the index.
KEY_ITERATION - Static variable in class de.fu_berlin.ties.extract.TrainEval
Serialization key for the number of the iteration (when TUNE training).
KEY_LOCAL_CHARSET - Static variable in class de.fu_berlin.ties.io.IOUtils
Configuration and context key: the character set to use when reading and writing local files.
KEY_LOCAL_NAME - Static variable in class de.fu_berlin.ties.TextProcessor
Context key referring to the local name of the processed document.
KEY_MIME_TYPE - Static variable in class de.fu_berlin.ties.io.ContentType
Context key recommended to store the MIME type string.
KEY_OUT_DIRECTORY - Static variable in class de.fu_berlin.ties.TextProcessor
Context key referring output directory; if missing, the value of TextProcessor.KEY_DIRECTORY is used instead.
KEY_PR - Static variable in class de.fu_berlin.ties.classify.Prediction
Serialization key for the pR.
KEY_PRECISION - Static variable in class de.fu_berlin.ties.eval.FMetrics
Serialization key for the precision.
KEY_PROB - Static variable in class de.fu_berlin.ties.classify.Prediction
Serialization key for the probability.
KEY_RAW - Static variable in class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Serialization key for the predicted class.
KEY_RECALL - Static variable in class de.fu_berlin.ties.eval.FMetrics
Serialization key for the recall.
KEY_RUN - Static variable in class de.fu_berlin.ties.extract.TrainEval
Serialization key for the number of the run.
KEY_SIGMOID - Static variable in class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Serialization key for the source identifier.
KEY_SOURCE - Static variable in class de.fu_berlin.ties.classify.Prediction
Serialization key for the source identifier.
KEY_TEXT - Static variable in class de.fu_berlin.ties.extract.Extraction
Serialization key for the extracted text.
KEY_TRUE_COUNT - Static variable in class de.fu_berlin.ties.eval.Accuracy
Serialization key for the number of true items.
KEY_TRUE_POS - Static variable in class de.fu_berlin.ties.eval.FMetrics
Serialization key for true positives.
KEY_TYPE - Static variable in class de.fu_berlin.ties.classify.Prediction
Serialization key for the predicted class.
KEY_TYPE - Static variable in class de.fu_berlin.ties.eval.MultiFMetrics
Serialization key for the type.
KEY_TYPE - Static variable in class de.fu_berlin.ties.extract.TrainEval
Serialization key for the type (either "Train" or "Eval").
KEY_UNIQUE_FEATURES - Static variable in class de.fu_berlin.ties.eval.FeatureCount
Serialization key for unique features.
KEY_URL - Static variable in class de.fu_berlin.ties.TextProcessor
Context key referring to the URL of the processed document, if loaded from an URL.
keyCount() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns the number of keys in this container.
keyIterator() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns an iterator over the set of all keys used in contained field maps.
keySet() - Method in class de.fu_berlin.ties.util.MultiValueMap
Returns a set view of the keys contained in this map.

L

last() - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns the last extraction added to this container.
lastContains(String) - Method in class de.fu_berlin.ties.text.TokenContainer
Whether the text added by the last TokenContainer.add(String) operation contains the specified token.
lastIterator() - Method in class de.fu_berlin.ties.text.TokenContainer
Returns an iterator over the word and number tokens added by the last TokenContainer.add(String) operation.
lastN(int) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns a list of the last n extractions added to this container.
lastN(String, int) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns a list of the last n extractions of a specified class added to this container.
lastN(List<? extends T>, int) - Static method in class de.fu_berlin.ties.util.CollectionUtils
Copied the last n elements from a list into a new list (or all elements, if the size of the input list is smaller or equal to n).
LAX_START_OR_EMPTY_TAG - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Pattern specifying of a "lax" XML start or empty tag that can contain unquoted (invalid) attributes (combined into a single pattern to avoid unnecessary backtracking).
leftText() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns the complete text to the left (preceding) the token returned by the last call to TextTokenizer.nextToken().
LINE_SEPARATOR - Static variable in class de.fu_berlin.ties.text.TextUtils
The line separator on the current operating system ("\n" on Unix).
LINE_SEPARATOR_CLASS - Static variable in class de.fu_berlin.ties.xml.TagIsolator
The character class of line separators.
ListEntry - Class in de.fu_berlin.ties.util
An entry in a doubly linked list.
ListEntry() - Constructor for class de.fu_berlin.ties.util.ListEntry
Creates a new instance, without setting references to surrounding entries.
ListEntry(ListEntry, ListEntry) - Constructor for class de.fu_berlin.ties.util.ListEntry
Creates a new instance.
listIsEmpty(List) - Static method in class de.fu_berlin.ties.TiesConfiguration
Utility method that thorougly checks whether a list is empty.
ListSensor - Class in de.fu_berlin.ties.context.sensor
A list sensor uses a one or several gazetteers to look up semantic information.
ListSensor(TiesConfiguration) - Constructor for class de.fu_berlin.ties.context.sensor.ListSensor
Creates a new instance.
load(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Loads configuration in PropertiesConfiguration or XML format.
LocalFeature - Class in de.fu_berlin.ties.context
An immutable representation of the local part of a feature, useful to cache and re-use the parts of feature representation that do not depend on the relative position of the element to classify.
LocalFeature(String, FeatureType, String, String, String) - Constructor for class de.fu_berlin.ties.context.LocalFeature
Non-public constructor.
localizeKey(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Creates a localized (language-specific) key by joining a base name with the configured language suffix (value mapped to the TiesConfiguration.CONFIG_LANG} key (if this key doesn't exist, the language of default locale used by the Java Virtual Machine is used).
LOG - Static variable in class de.fu_berlin.ties.util.Util
The logger used in the TIE system.
logEvent(String, String) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Logs the occurance of an event necessary for fixing a document.
logEvent(String, TagConstituent) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Logs the occurance of an event necessary for fixing a document.
lookup(String) - Method in class de.fu_berlin.ties.context.sensor.ListSensor
Looks up a token and returns the information as an array of key/value pairs (values can be null, but null are not allowed).
lookup(String) - Method in interface de.fu_berlin.ties.context.sensor.Sensor
Looks up a token and returns the information as an array of key/value pairs (values can be null, but null are not allowed).
lookupType(String) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Looks up and if necessary initializes the metrics for a given type.

M

Main - Class in de.fu_berlin.ties
A static class that provides a main external entry point to TIE.
main(String[]) - Static method in class de.fu_berlin.ties.Main
Main method: execute the given goal (first argument) on the given arguments.
main(String[]) - Static method in class de.fu_berlin.ties.util.VelocityService
Main methods: prints the list of config parameters and goals from the standard configuration to specified files (or standard out).
majorThreshold(float, float) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Calculates the major theshold (theta+) to use for classification with the "thick threshold" heuristic.
MARKER - Static variable in class de.fu_berlin.ties.context.FeatureType
Constant for features marking a special position or situation.
MARKER_START - Static variable in class de.fu_berlin.ties.context.LocalFeature
Constant for a marker feature that marks the start of an axis.
markRelevant(String) - Method in class de.fu_berlin.ties.extract.ExtractorBase
Marks a punctuation token as relevant for classification ((because it did occur as the first or last token of an extraction).
MARKUP_DECL - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an markup declaration within a doctype declaration.
matches(Element) - Method in interface de.fu_berlin.ties.filter.ElementFilter
Decides whether an element is accepted by this filter.
matches(Element) - Method in class de.fu_berlin.ties.filter.RepresentationFilter
Decides whether an element is accepted by this filter.
matches(Element) - Method in class de.fu_berlin.ties.filter.TrainableFilter
Decides whether an element is accepted by this filter.
matches(Node) - Method in class de.fu_berlin.ties.xml.dom.ElementNameFilter
Tests whether a node is accected by this filter.
MathUtils - Class in de.fu_berlin.ties.util
A static class that provides some mathematical utility methods.
maxSize() - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Returns the maximum number of features that can be stored by this instance.
mean(double[]) - Static method in class de.fu_berlin.ties.util.MathUtils
Returns the arithmetic mean of an array of values.
META_CLASSIFIER - Static variable in class de.fu_berlin.ties.classify.TrainableClassifier
Flag used to load the MetaClassifier.
MetaClassifier - Class in de.fu_berlin.ties.classify
A meta classifier combines several layers of classifiers.
MetaClassifier(Set<String>, FeatureTransformer, File, String[], TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.MetaClassifier
Creates a new instance.
MetaClassifier(Set<String>, FeatureTransformer, File, String[], int, String[], TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.MetaClassifier
Creates a new instance.
MIME_HTML - Static variable in class de.fu_berlin.ties.io.ContentType
MIME type for HTML.
MIME_PLAIN - Static variable in class de.fu_berlin.ties.io.ContentType
MIME type for plain text.
minorThreshold(float, float) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Calculates the minor theshold (theta-) to use for classification with the "thick threshold" heuristic.
MISSING - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for missing answer keys (false negatives).
modifyProperty(String, boolean) - Method in class de.fu_berlin.ties.TiesConfiguration
Modifies a configuration property, parsing a key[=value] pair.
MULTI_CLASSIFIER - Static variable in class de.fu_berlin.ties.classify.TrainableClassifier
Flag used to load the MultiBinaryClassifier.
MultiBinaryClassifier - Class in de.fu_berlin.ties.classify
This classifier converts an multi-class classification task into a several binary (two-class) classification task.
MultiBinaryClassifier(Set<String>, FeatureTransformer, File, String[], TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.MultiBinaryClassifier
Creates a new instance.
MultiFMetrics - Class in de.fu_berlin.ties.eval
Instances of this class manage multiple FMetrics for different types.
MultiFMetrics() - Constructor for class de.fu_berlin.ties.eval.MultiFMetrics
Creates a new instance, without calculating summaries.
MultiFMetrics(boolean) - Constructor for class de.fu_berlin.ties.eval.MultiFMetrics
Creates a new instance.
MultiFMetricsView - Interface in de.fu_berlin.ties.eval
A read-only view of multiple FMetrics and the sums and averages calculated over them.
multipleReplaceAll(CharSequence, Map) - Static method in class de.fu_berlin.ties.text.TextUtils
Performs multiple replace-all operations on a text.
MultiValueMap<K,V> - Class in de.fu_berlin.ties.util
A MultiValueMap allows storing multiple values for each key.
MultiValueMap() - Constructor for class de.fu_berlin.ties.util.MultiValueMap
Creates a new instance, using a HashMap as storage.
MultiValueMap(Map<K, Collection<V>>) - Constructor for class de.fu_berlin.ties.util.MultiValueMap
Creates a new instance.
MutableInt - Class in de.fu_berlin.ties.util
Small wrapper class for an int whose value can be changed.
MutableInt() - Constructor for class de.fu_berlin.ties.util.MutableInt
Creates a new instance, setting the initial value to 0.
MutableInt(int) - Constructor for class de.fu_berlin.ties.util.MutableInt
Creates a new instance.

N

NAME - Static variable in class de.fu_berlin.ties.Main
The name of this software.
name(Attribute) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Static method that returns a String representing the name of an attribute in an XML document.
name(Element) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Static method that returns a String representing the name of an element in an XML document.
NAN - Static variable in class de.fu_berlin.ties.util.Util
String recognized as Not-a-Number when parsing floating-point numbers: "NaN" (ignoring case).
nestedCount() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns the number of nested containers managed by container.
nestedIterator() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns an iterator over the names of nested containers managed by this container.
NEWLINE_ALTERNATIVES - Static variable in class de.fu_berlin.ties.text.TextUtils
Regex fragment listing the newline alternatives used by differents systems: "\r\n" (Windows), "\n" (Unix) or "\r" (Mac).
NEWLINE_PATTERN - Static variable in class de.fu_berlin.ties.text.TextUtils
A regular expression matching a single newlines (build by enclosing TextUtils.NEWLINE_ALTERNATIVES in a non-capturing group).
NEWLINES_PATTERN - Static variable in class de.fu_berlin.ties.text.TextUtils
A regular expression matching newlines, including surrounding whitespace.
next() - Method in class de.fu_berlin.ties.util.ListEntry
Returns the next element in the list, if any.
nextConstituent() - Method in class de.fu_berlin.ties.xml.XMLConstituent
Returns the next constituent in the list, if any.
nextToken() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns the next token, or null if there are no more tokens left in the provided text.
normalize(String) - Static method in class de.fu_berlin.ties.text.TextUtils
Normalizes the whitespace in a string, replacing all internal whitespace sequences with a single space character and trimming any leading and trailing whitespace.

O

OAR_CLASSIFIER - Static variable in class de.fu_berlin.ties.classify.TrainableClassifier
Flag used to load the OneAgainstTheRestClassifier.
ObjectElement - Class in de.fu_berlin.ties.xml.io
An XML element that is targeted at storing an Java object.
ObjectElement(String, Class) - Constructor for class de.fu_berlin.ties.xml.io.ObjectElement
Creates a new instance.
ObjectElement(QName, Class) - Constructor for class de.fu_berlin.ties.xml.io.ObjectElement
Creates a new instance.
ObjectElement(String, Namespace, Class) - Constructor for class de.fu_berlin.ties.xml.io.ObjectElement
Creates a new instance.
OneAgainstTheRestClassifier - Class in de.fu_berlin.ties.classify
This classifier converts an multi-class classification task into a several binary (two-class) classification task.
OneAgainstTheRestClassifier(Set<String>, FeatureTransformer, File, String[], TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Creates a new instance.
OPEN - Static variable in class de.fu_berlin.ties.context.LocalFeature
Character opening a quoted value.
OpenCloseStrategy - Class in de.fu_berlin.ties.combi
A combination strategy using open/close tagging (also called "BIE" tagging due to the prefixed used).
OpenCloseStrategy(Set<String>) - Constructor for class de.fu_berlin.ties.combi.OpenCloseStrategy
Creates a new instance.
openCompressableInStream(InputStream) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens an input stream that might have been compressed in gzip format.
openCompressableOutStream(OutputStream, boolean) - Static method in class de.fu_berlin.ties.io.IOUtils
Transparently opens an input stream that may use compression to store the data (in gzip format).
openCompressableOutStream(OutputStream, TiesConfiguration) - Static method in class de.fu_berlin.ties.io.IOUtils
Transparently opens an input stream that may use compression to store the data (in gzip format).
openReader(File, Configuration) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens an reader on a local file.
openReader(File, String) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens an reader on a local file, using a given charset.
OpenTags - Class in de.fu_berlin.ties.xml
A container used to manage the currently open start tags during XML adjustment.
OpenTags() - Constructor for class de.fu_berlin.ties.xml.OpenTags
Creates a new instance.
openUnicodeReader(InputStream) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens a reader on an input stream that uses a Unicode character set (UTF-8, UTF-16, or UTF-32) and optionally a BOM (byte order mark) to identify the used charset.
openUnicodeWriter(OutputStream) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens a writer that uses the standard Unicode character set UTF-8.
openWriter(File, Configuration) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens an writer on a local file.
openWriter(File, String) - Static method in class de.fu_berlin.ties.io.IOUtils
Opens an writer on a local file, using a given charset.
Oracle - Interface in de.fu_berlin.ties.filter
An oracle decides whether ElementFilters should match elements.
OSBTransformer - Class in de.fu_berlin.ties.classify.feature
Transforms a feature vector using the orthogonal sparse bigrams (OSB) technique developed by Fidelis Assis.
OSBTransformer(FeatureTransformer, int, String, boolean, float[], float) - Constructor for class de.fu_berlin.ties.classify.feature.OSBTransformer
Creates a new instance.
OSBTransformer(FeatureTransformer, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.feature.OSBTransformer
Creates a new instance.
OtherConstituent - Class in de.fu_berlin.ties.xml
A constituent in an XML document that is not a tag.
OtherConstituent(short, String) - Constructor for class de.fu_berlin.ties.xml.OtherConstituent
Creates a new instance, without setting a reference to a next constituent.
OtherConstituent(short, String, XMLConstituent, XMLConstituent) - Constructor for class de.fu_berlin.ties.xml.OtherConstituent
Creates a new instance.
OUTER_WHITESPACE - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: Whitespace between elements.
OUTSIDE - Static variable in class de.fu_berlin.ties.combi.CombinationState
Combination state to be used for outside/other (type is set to null).

P

parse(String) - Static method in class de.fu_berlin.ties.eval.EvalStatus
Parses a textual representation into an eval status, without using a default object.
parse(String, boolean) - Static method in class de.fu_berlin.ties.eval.EvalStatus
Parses a textual representation into an eval status.
ParsingException - Exception in de.fu_berlin.ties
Exception thrown when an input cannot be parsed, typically because it contains uncorrectable errors.
ParsingException(String) - Constructor for exception de.fu_berlin.ties.ParsingException
Constructs a new exception with the specified detail message.
ParsingException(Throwable) - Constructor for exception de.fu_berlin.ties.ParsingException
Constructs a new exception with the specified cause.
ParsingException(String, Throwable) - Constructor for exception de.fu_berlin.ties.ParsingException
Constructs a new exception with the specified detail message and cause.
PE_REFERENCE - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying a PE reference within a doctype declaration.
peek() - Method in class de.fu_berlin.ties.xml.OpenTags
Returns the last tag appearance that was pushed into this container.
peekFirst() - Method in class de.fu_berlin.ties.xml.OpenTags
Returns the first tag appearance that was initially pushed into this container.
PI - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: a processing instruction.
pop() - Method in class de.fu_berlin.ties.xml.OpenTags
Removes and returns the last tag appearance that was pushed into this container.
popAndRegularize() - Method in class de.fu_berlin.ties.xml.OpenTags
Removes and returns the last tag appearance that was pushed into this container.
preAddHook(Feature) - Method in class de.fu_berlin.ties.classify.feature.FeatureSet
Implementation of the hook provided by the superclass to ensure that the summed strength is updated correctly.
preAddHook(Feature) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Empty method that can by implemented by child classes, for example if they calculate strength values in some special way.
precedingWhitespace() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns the whitespace (i.e., text not matched by any token) preceding the token returned by the last call to TextTokenizer.nextToken().
precedingWhitespaceIsValid() - Method in class de.fu_berlin.ties.text.TextTokenizer
Checks whether the whitespace (i.e., text not matched by any token) preceding the token returned by the last call to TextTokenizer.nextToken() matches the defined whitespace pattern.
Prediction - Class in de.fu_berlin.ties.classify
A prediction, wrapping the predicted class and the probability of the prediction.
Prediction(FieldMap) - Constructor for class de.fu_berlin.ties.classify.Prediction
Creates a new instance from a field map, fulfilling the Storable contract.
Prediction(String, Probability) - Constructor for class de.fu_berlin.ties.classify.Prediction
Creates a new instance, setting the evaluation status to EvalStatus.UNKNOWN.
Prediction(String, Probability, EvalStatus) - Constructor for class de.fu_berlin.ties.classify.Prediction
Creates a new instance.
Prediction(String, String, Probability, EvalStatus) - Constructor for class de.fu_berlin.ties.classify.Prediction
Creates a new instance.
PredictionComparator - Class in de.fu_berlin.ties.classify
A comparison function that compares Predictions based on their probabilities.
PredictionComparator() - Constructor for class de.fu_berlin.ties.classify.PredictionComparator
Creates a new instance.
PredictionDistribution - Class in de.fu_berlin.ties.classify
A distribution over the classes predicted by a classifier.
PredictionDistribution() - Constructor for class de.fu_berlin.ties.classify.PredictionDistribution
Creates a new instance.
PredictionDistribution(Prediction) - Constructor for class de.fu_berlin.ties.classify.PredictionDistribution
Creates a new instance, adding a prediction.
prefers(Element) - Method in interface de.fu_berlin.ties.filter.ElementFilter
Whether the filter would prefer to filter an element.
prefers(Element) - Method in class de.fu_berlin.ties.filter.TrainableFilter
Whether the filter would prefer to filter an element.
PREFIX_GLOBAL_ACC - Static variable in class de.fu_berlin.ties.extract.Trainer
Prefix used for serializing the global (overall) accuracy.
PREFIX_LOCAL_ACC - Static variable in class de.fu_berlin.ties.extract.Trainer
Prefix used for serializing the local (document-specific) accuracy.
PREFIX_STD_DEVIATION - Static variable in class de.fu_berlin.ties.eval.SummaryFMetrics
Prefix of serialization keys for standard deviations.
PreProcessor - Class in de.fu_berlin.ties.preprocess
Preprocesses documents by converting them a suitable XML format and adding lingustic information.
PreProcessor() - Constructor for class de.fu_berlin.ties.preprocess.PreProcessor
Creates and configured a new instance, using a default extension and the standard configuration.
PreProcessor(String) - Constructor for class de.fu_berlin.ties.preprocess.PreProcessor
Creates and configured a new instance, using the standard configuration.
PreProcessor(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.preprocess.PreProcessor
Creates and configured a new instance.
previous() - Method in class de.fu_berlin.ties.util.ListEntry
Returns the previous element in the list, if any.
previousConstituent() - Method in class de.fu_berlin.ties.xml.XMLConstituent
Returns the previous constituent in the list, if any.
printConfigProperties(TiesConfiguration, String, String, Map<String, Object>, Writer) - Static method in class de.fu_berlin.ties.util.VelocityService
Prints all properties contained in a configuration, using the specified template for rendering.
printConfigProperties(TiesConfiguration, Writer) - Static method in class de.fu_berlin.ties.util.VelocityService
Prints all properties contained in a configuration, using the default template for printing them.
printGoals(TiesConfiguration, String, String, Map<String, Object>, Writer) - Static method in class de.fu_berlin.ties.util.VelocityService
Prints the goals contained in a configuration, using the default template for printing them.
printGoals(TiesConfiguration, Writer) - Static method in class de.fu_berlin.ties.util.VelocityService
Prints the goals contained in a configuration, using the default template for printing them.
PriorRecognitions - Class in de.fu_berlin.ties.context
A buffer of prior Recognitions that should be considered in the context representation.
PriorRecognitions(int) - Constructor for class de.fu_berlin.ties.context.PriorRecognitions
Creates a new instance.
PriorRecognitions.Pair - Class in de.fu_berlin.ties.context
An inner class wrapping a recognition and optionally a feature vector representing the recognition.
PriorRecognitions.Pair(Recognition) - Constructor for class de.fu_berlin.ties.context.PriorRecognitions.Pair
Creates a new instance.
Probability - Class in de.fu_berlin.ties.classify
Wraps a probability.
Probability(double) - Constructor for class de.fu_berlin.ties.classify.Probability
Creates a new instance, setting the pR to Double.NaN (unknown).
Probability(double, double) - Constructor for class de.fu_berlin.ties.classify.Probability
Creates a new instance.
probCount() - Method in class de.fu_berlin.ties.classify.Prediction
Returns the number of probabilities combined in this prediction.
process(String) - Method in class de.fu_berlin.ties.CollectingProcessor
Processes an input argument.
process(List<String>, ContextMap) - Method in class de.fu_berlin.ties.CollectingProcessor
Processes the collected input arguments.
process(File[], ContextMap) - Method in class de.fu_berlin.ties.DirectoryProcessor
Processes an array of files.
process(File) - Method in class de.fu_berlin.ties.DirectoryProcessor
Processes the contents of a directory.
process(String) - Method in class de.fu_berlin.ties.DirectoryProcessor
Processes an input argument that must specify a directory.
process(Document, ContextMap) - Method in class de.fu_berlin.ties.DocumentProcessor
Processes an XML document.
process(Document, Writer, ContextMap) - Method in class de.fu_berlin.ties.DocumentProcessor
Processes an XML document, writing a modified version to a writer.
process(Document, Writer, ContextMap) - Method in class de.fu_berlin.ties.DocumentReader
Reads an XML document and stores the results of processing in a writer.
process(List<String>, ContextMap) - Method in class de.fu_berlin.ties.eval.ShuffleGenerator
Processes the collected input arguments.
process(Document, Writer, ContextMap) - Method in class de.fu_berlin.ties.extract.AnswerBuilder
Buildings an ExtractionContainer of answer keys from from an annotated XML document.
process(Document, Writer, ContextMap) - Method in class de.fu_berlin.ties.extract.Extractor
Extracts items of interest from the contents of an XML document and serializes the extractions.
process(Document, Writer, ContextMap) - Method in class de.fu_berlin.ties.extract.Trainer
Trains the local classifier with the correct extractions of an XML document, using the provided context representation.
process(String) - Method in interface de.fu_berlin.ties.Processor
Processes an input argument.
process(Reader, Writer, ContextMap) - Method in class de.fu_berlin.ties.TextProcessor
Delegates to the abstract TextProcessor.doProcess(Reader, Writer, ContextMap) method and invokes a post-processor, if configured.
process(File, Writer) - Method in class de.fu_berlin.ties.TextProcessor
Processes the contents of a file, delegating to the TextProcessor.process(File, Writer, ContextMap) method.
process(File, Writer, ContextMap) - Method in class de.fu_berlin.ties.TextProcessor
Processes the contents of a file, delegating to the TextProcessor.process(Reader, Writer, ContextMap) method.
process(String) - Method in class de.fu_berlin.ties.TextProcessor
Processes a file or URL given as input argument, delegating to the appropriate process method.
process(URLConnection, Writer) - Method in class de.fu_berlin.ties.TextProcessor
Processes the contents of an URL connection, delegating to the TextProcessor.process(URLConnection, Writer, ContextMap) method.
process(URLConnection, Writer, ContextMap) - Method in class de.fu_berlin.ties.TextProcessor
Processes the contents of an URL connection, delegating to the TextProcessor.process(Reader, Writer, ContextMap) method.
process(Document, Writer, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.XMLStripper
Strips all markup from an XML document and stores the resulting plain text.
processCollectedText(Element, CharSequence, TokenCounter, TextTokenizer, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.TokenWalker
Helper method that tokenizes the collected textual contents of an element and delegates to the token processor for each of them.
processElement(Element, TokenContainer, ContextMap) - Method in class de.fu_berlin.ties.extract.AnswerBuilder
Classifies an element in an XML document, building features and delegating to the classifier.
processElement(Element, TokenContainer, ContextMap) - Method in interface de.fu_berlin.ties.xml.dom.ElementProcessor
Processes an element in an XML document, optionally modifying the element or the document it is part of.
ProcessingException - Exception in de.fu_berlin.ties
Exception thrown when an error occurs during processing.
ProcessingException(String) - Constructor for exception de.fu_berlin.ties.ProcessingException
Constructs a new exception with the specified detail message.
ProcessingException(Throwable) - Constructor for exception de.fu_berlin.ties.ProcessingException
Constructs a new exception with the specified cause.
ProcessingException(String, Throwable) - Constructor for exception de.fu_berlin.ties.ProcessingException
Constructs a new exception with the specified detail message and cause.
Processor - Interface in de.fu_berlin.ties
Interface for a processor that operates on an input argument.
processToken(Element, String, TokenDetails, String, ContextMap) - Method in class de.fu_berlin.ties.extract.Extractor
Processes a token in an XML element, optionally modifying the element or the document it is part of.
processToken(Element, String, TokenDetails, String, ContextMap) - Method in class de.fu_berlin.ties.extract.Trainer
Processes a token in an XML element, optionally modifying the element or the document it is part of.
processToken(Element, String, TokenDetails, String, ContextMap) - Method in class de.fu_berlin.ties.filter.EmbeddingElements
Processes a token in an XML element, optionally modifying the element or the document it is part of.
processToken(Element, String, TokenDetails, String, ContextMap) - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
Processes a token in an XML element by delegating to the configured TokenProcessor.
processToken(Element, String, TokenDetails, String, ContextMap) - Method in interface de.fu_berlin.ties.xml.dom.TokenProcessor
Processes a token in an XML element, optionally modifying the element or the document it is part of.
processToken(Element, String, TokenDetails, String, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.TokenWalker
Processes a token in an XML element by delegating to the configured TokenProcessor.
propertyIsEmpty(Object) - Static method in class de.fu_berlin.ties.TiesConfiguration
Utility method that thorougly checks whether a property (as returned by Configuration.getProperty(String) is empty.
Pruner - Interface in de.fu_berlin.ties.util
A pruner decidedes which map entries should be pruned (deleted), choosing among several candidate entries.
PSEUDO_AMP - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
A "&" that is not the start of an predefined entity reference or a character reference and thus should be escaped if XMLAdjuster.isEscapingPseudoEntities() is true.
punctuation(CharSequence) - Static method in class de.fu_berlin.ties.text.TextUtils
Checks whether a string contains only punctuation characters.
PUNCTUATION_PATTERN - Static variable in class de.fu_berlin.ties.text.TextUtils
A simple regular expression for strings that contain only punctuation characters.
PUNCTUATION_SYMBOL_PATTERN - Static variable in class de.fu_berlin.ties.text.TextUtils
A simple regular expression for strings that contain only punctuation and symbol characters.
punctuationOrSymbol(CharSequence) - Static method in class de.fu_berlin.ties.text.TextUtils
Checks whether a string contains only punctuation and symbol characters.
push(TagConstituent, boolean) - Method in class de.fu_berlin.ties.xml.OpenTags
Inserts an appearance of a tag into the container.
push(TagConstituent) - Method in class de.fu_berlin.ties.xml.TagContainer
Inserts an appearance of a tag into the container.
push(TagConstituent, boolean) - Method in class de.fu_berlin.ties.xml.TagContainer
Inserts an appearance of a tag into the container.
push(TagConstituent, boolean) - Method in class de.fu_berlin.ties.xml.UnprocessedTags
Inserts an appearance of a tag into the container.
put(K, V) - Method in class de.fu_berlin.ties.util.MultiValueMap
Adds the value to the collection associated with the specified key.
putAll(K, Collection<? extends V>) - Method in class de.fu_berlin.ties.util.MultiValueMap
Adds a collection of values to the collection associated with the specified key.
putWeights(Integer, float[]) - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Stores new weights for a feature.

Q

quote(String) - Static method in class de.fu_berlin.ties.context.LocalFeature
Quotes a value text (by enclosing it between LocalFeature.OPEN and LocalFeature.CLOSE character).

R

rawConstituents(CharSequence, boolean) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Returns the raw constituents of an XML-like document.
rawConstituents(CharSequence, boolean, UnprocessedTags) - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Returns the raw constituents of an XML-like document.
rawThreshold(FeatureSet) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Calculates the theshold (theta) to use for classification, based on the number of active features.
reachedEndOfDocument() - Method in class de.fu_berlin.ties.extract.ExtractionLocator
This method must be called at the end of the current document.
readAnswerKeys(TargetStructure, File, Configuration) - Static method in class de.fu_berlin.ties.extract.AnswerBuilder
Reads back answer keys stored by the AnswerBuilder.process(Document, Writer, ContextMap) method of an instance of this class.
readCorrespondingAnswerKeys(TargetStructure, File, Configuration) - Static method in class de.fu_berlin.ties.extract.AnswerBuilder
Reads the answer keys corresponding to a file.
readDocument(File, Configuration) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Reads an XML document from a local file, using a configured charset.
readDocument(File, String) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Reads an XML document from a local file, using a given charset.
readDocument(InputStream) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Reads an XML document from a given stream.
readDocument(Reader) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Reads an XML document from a given reader.
readDocument(Reader) - Method in interface de.fu_berlin.ties.xml.io.DocumentSerializer
Reads an XML document from a given reader.
readDocument(InputStream) - Method in interface de.fu_berlin.ties.xml.io.DocumentSerializer
Reads an XML document from a given stream.
readToString(Reader) - Static method in class de.fu_berlin.ties.io.IOUtils
Reads the contents of a reader into a string.
readToWriter(Reader, Writer) - Static method in class de.fu_berlin.ties.io.IOUtils
Reads the contents of a reader into a writer.
readURIList(CharSequence) - Static method in class de.fu_berlin.ties.io.IOUtils
Converts an URI list as defined in RFC 2483 (MIME type text/uri-list) into an array of strings.
readURIList(Reader) - Static method in class de.fu_berlin.ties.io.IOUtils
Converts an URI list as defined in RFC 2483 (MIME type text/uri-list) into an array of strings.
Recognition - Interface in de.fu_berlin.ties.context
A recognition has a type (class) and contains text.
recommendedExtension() - Static method in class de.fu_berlin.ties.io.FieldContainer
Returns the file extension recommended for FieldContainers.
ReEvaluator - Class in de.fu_berlin.ties.eval
A processor that can be used to re-evaluate the contents of an EvaluatedExtractionContainer.
ReEvaluator() - Constructor for class de.fu_berlin.ties.eval.ReEvaluator
Creates a new instance, using a default extension and the standard configuration.
ReEvaluator(String) - Constructor for class de.fu_berlin.ties.eval.ReEvaluator
Creates a new instance, using the standard configuration.
ReEvaluator(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.eval.ReEvaluator
Creates a new instance.
reEvalulate(ExtractionContainer, EvaluatedExtractionContainer) - Method in class de.fu_berlin.ties.eval.ReEvaluator
Re-evaluates the contents of an extraction container.
reEvalulate(Reader) - Method in class de.fu_berlin.ties.eval.ReEvaluator
Re-evaluates the serialized contents of an extraction container, delegating to ReEvaluator.reEvalulate(ExtractionContainer, EvaluatedExtractionContainer).
reextract() - Method in interface de.fu_berlin.ties.extract.amend.FinalReextractor
Re-extract all extractions in a document at the end of the document, considering the complete states of all tokens.
registerInterest() - Static method in class de.fu_berlin.ties.util.TaskRunner
Registers interest to use the default runner.
REGULAR - Static variable in class de.fu_berlin.ties.xml.TagVariety
Constant for the most common tag variety, marking all tags that pre-existed in the input data.
remove(Extraction) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Removes an extraction from this container, if it is present.
remove() - Method in class de.fu_berlin.ties.util.ListEntry
Removes this entry from this list.
remove(K) - Method in class de.fu_berlin.ties.util.MultiValueMap
Removes all mappings for this key from this map if any are present.
remove(K, V) - Method in class de.fu_berlin.ties.util.MultiValueMap
Removes a specific value from map.
remove(TagConstituent) - Method in class de.fu_berlin.ties.xml.OpenTags
Removes a single appearance of a tag from the container.
remove(TagConstituent) - Method in class de.fu_berlin.ties.xml.TagContainer
Removes a single appearance of a tag from the container.
remove(TagConstituent) - Method in class de.fu_berlin.ties.xml.UnprocessedTags
Removes a single appearance of a tag from the container.
removeExtraMarkers(List) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Modifies a list of GlobalFeatures to remove extraneous FeatureType.MARKER features.
removeLast() - Method in class de.fu_berlin.ties.context.PriorRecognitions
Removes the most recently inserted recognition from this buffer.
removeLast() - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Removes the last extraction added to this container.
removeLRU(AbstractLinkedMap.LinkEntry) - Method in class de.fu_berlin.ties.util.AdaptableLRUMap
Controls removal of one of the least recently used entries from the map.
removeProb(boolean) - Method in class de.fu_berlin.ties.classify.Prediction
Deletes one of the probabilities used for the prediction.
removeToken(boolean) - Method in class de.fu_berlin.ties.extract.Extraction
Deletes one of the tokens from this prediction.
renderTemplate(String, Map, Writer) - Static method in class de.fu_berlin.ties.util.VelocityService
Renders a template using the given context objects.
renderTemplate(String, String, Map, Writer) - Static method in class de.fu_berlin.ties.util.VelocityService
Renders a template using the given context objects.
replaceAll(String, Matcher, String) - Static method in class de.fu_berlin.ties.text.TextUtils
Replaces each substring of the input matched by the given pattern matcher with the given replacement.
replaceAll(String, Pattern, String) - Static method in class de.fu_berlin.ties.text.TextUtils
Replaces each substring of the input that matches the given Pattern with the given replacement.
Representation - Class in de.fu_berlin.ties.context
Abstract class that manages context representations for entity recognition and information extraction.
Representation(int) - Constructor for class de.fu_berlin.ties.context.Representation
Creates a new instance.
RepresentationFilter - Class in de.fu_berlin.ties.filter
A trainable filter that uses a Representationto convert elements into feature vectors.
RepresentationFilter(TiesConfiguration, NodeFilter, NodeFilter, Reranker, Representation, String) - Constructor for class de.fu_berlin.ties.filter.RepresentationFilter
Creates a new instance.
reproducibleRandom() - Static method in class de.fu_berlin.ties.util.Util
Returns an instance of a pseudo-random number generator that uses a fixed seed, so the same sequence of calls to each object created by this method will yield the same sequence of pseudo-random numbers.
rerank(PredictionDistribution) - Method in class de.fu_berlin.ties.classify.Reranker
Reranks the predictions in a distribution.
Reranker - Class in de.fu_berlin.ties.classify
Reranks the predictions in a distribution by multiplying the probabilities of each of them with a bias, if specified for the type of the prediction.
Reranker(Configuration) - Constructor for class de.fu_berlin.ties.classify.Reranker
Creates a new instance.
Reranker(Map<String, Double>) - Constructor for class de.fu_berlin.ties.classify.Reranker
Creates a new instance.
reset() - Method in class de.fu_berlin.ties.classify.ExternalClassifier
Resets the classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.classify.MetaClassifier
Resets the classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Resets the classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Resets the classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Resets the classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Resets the classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Resets the store, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Resets the CombinationStrategy.state() of this instance to the initial value CombinationState.OUTSIDE.
reset() - Method in class de.fu_berlin.ties.extract.Trainer
Resets the internal classifer, completely deleting the prediction model.
reset() - Method in class de.fu_berlin.ties.text.TextTokenizer
Resets this tokenizer, so it will restart at the begin of the current text.
reset(CharSequence) - Method in class de.fu_berlin.ties.text.TextTokenizer
Resets this tokenizer, so it will restart at the begin of the provided text.
resetGlobalAccuracy() - Method in class de.fu_berlin.ties.extract.Trainer
Resets the global (overall) accuracies measured so far by each classifier.
resetHook() - Method in class de.fu_berlin.ties.combi.BeginEndStrategy
Hook called by the CombinationStrategy.reset() method to query whether the last extraction should be discarded, analogously to CombinationState.isDiscardPreceding().
resetHook() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Hook called by the CombinationStrategy.reset() method to query whether the last extraction should be discarded, analogously to CombinationState.isDiscardPreceding().
resetStrategy() - Method in class de.fu_berlin.ties.extract.Extractor
Reset strategy and discard last prediction extraction if requested.
resetStrategy() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Reset the combination strategy, handling the boolean result value in an appropriate way.
resetStrategy() - Method in class de.fu_berlin.ties.extract.Trainer
Reset the combination strategy, logging a warning if it tells me to discard the last extraction.
RestorableContainer - Interface in de.fu_berlin.ties.io
Classes implementing this interface contain multiple Storable objects and support serialization and deserialization of these objects in a human-readable format.
restoreEntries(FieldContainer) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Restores items stored in a field container and adds them to this instance.
restoreEntries(FieldContainer) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Restores extractions stored in a field container and adds them to this instance.
restoreEntries(FieldContainer) - Method in interface de.fu_berlin.ties.io.RestorableContainer
Restores items stored in a field container and adds them to this instance.
rightText() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns the complete text to the right (following) the token returned by the last call to TextTokenizer.nextToken().

S

save(File) - Method in class de.fu_berlin.ties.TiesConfiguration
Saves the contents of this configuration in a file, storing them in PropertiesConfiguration format.
save(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Saves the contents of this configuration in a file, storing them in PropertiesConfiguration format.
SBPHTransformer - Class in de.fu_berlin.ties.classify.feature
Transforms a feature vector using a simple implementation of the sparse binary polynomial hashing (SBPH) technique introduced by CRM114.
SBPHTransformer(FeatureTransformer, int, String) - Constructor for class de.fu_berlin.ties.classify.feature.SBPHTransformer
Creates a new instance.
SBPHTransformer(FeatureTransformer, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.feature.SBPHTransformer
Creates a new instance.
selectFollowingSiblings(Element, LinkedList<Element>, int) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Selects the siblings to keep among all following siblings.
selectPrecedingSiblings(Element, LinkedList<Element>, int) - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Selects the siblings to keep among all preceding siblings.
Sensor - Interface in de.fu_berlin.ties.context.sensor
A sensor is an object that looks up information for a token, for example semantic information from gazetteers or thesauri.
SEP - Static variable in class de.fu_berlin.ties.context.GlobalFeature
Separator string between axis and element names, used if neither or them is empty.
SEPARATOR - Static variable in class de.fu_berlin.ties.util.CollectionUtils
Default separator used to flatten array if no other separator is specified: " " (a single space).
setAttribute(String, Object) - Method in class de.fu_berlin.ties.io.FieldContainer
Sets name and value of an attribute.
setCachedFeatures(LinkedList<Feature>) - Method in class de.fu_berlin.ties.context.PriorRecognitions.Pair
Sets the list of features representing the recognition.
setDefaultDirectory(File) - Static method in class de.fu_berlin.ties.io.IOUtils
Modifies the default directory that can be given as "parent" when constructing file objects.
setEvalStatus(EvalStatus) - Method in class de.fu_berlin.ties.classify.Prediction
Modifies the evaluation status of this instance.
setFirstTokenRepIgnored(boolean) - Method in class de.fu_berlin.ties.extract.Extraction
Specified whether the repetition of the first token should be ignored, comparing only the text but not the position of extractions.
setMarkupSeriesNo(int) - Method in class de.fu_berlin.ties.xml.TagConstituent
Sets the number of the markup series this tag is part of.
setNormalizedWhitespace(String) - Method in class de.fu_berlin.ties.text.TextTokenizer
Changes the normalized whitespace representation prepended if TextTokenizer.isNormalizedWhitespacePrepended() is true.
setNormalizedWhitespacePrepended(boolean) - Method in class de.fu_berlin.ties.text.TextTokenizer
Changes whether whitespace is prepended in a normalized form (TextTokenizer.getNormalizedWhitespace()) to those tokens where TextTokenizer.hasPrecedingWhitespace() would return true.
setRepresentantion(String) - Method in class de.fu_berlin.ties.xml.XMLConstituent
Sets the representation of this tag within the XML document.
setSealed(boolean) - Method in class de.fu_berlin.ties.extract.Extraction
Seals or unseals this extraction.
setSource(String) - Method in class de.fu_berlin.ties.classify.Prediction
Sets the identifier of the source of this prediction (e.g., the file name).
setSummedStrength(double) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Modifies the summed strength of all features contained in this instance.
setValue(int) - Method in class de.fu_berlin.ties.util.MutableInt
Modifies the wrapped int value.
setVariety(TagVariety) - Method in class de.fu_berlin.ties.xml.TagConstituent
Specifies the TagVariety of this tag.
setWhitespacePatternEnsured(boolean) - Method in class de.fu_berlin.ties.text.TextTokenizer
Specifies whether whitespace (the text between patterns) is checked to ensure that the defined whitespace pattern is matched.
shorten(String, int, int) - Static method in class de.fu_berlin.ties.text.TextUtils
Shortens a string, inserting an ellipsis ("...") in the middle if the string is too long.
shorten(String, int) - Static method in class de.fu_berlin.ties.text.TextUtils
Delegates to TextUtils.shorten(String, int, int), using the same number of characters at the start and the end of the shortened string.
shorten(String) - Static method in class de.fu_berlin.ties.text.TextUtils
Delegates to TextUtils.shorten(String, int, int), showing up to 24 characters at the start and the end of the shortened string.
shouldMatch(Element) - Method in class de.fu_berlin.ties.extract.Trainer
Decides whether an element should be accepted by filters.
shouldMatch(Element) - Method in interface de.fu_berlin.ties.filter.Oracle
Decides whether an element should be accepted by filters.
shouldTrain(String, PredictionDistribution, ContextMap) - Method in class de.fu_berlin.ties.classify.ExternalClassifier
This implementation uses reinforcement training, if a thick threshold is configured.
shouldTrain(String, PredictionDistribution, ContextMap) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Invoked by TrainableClassifier.trainOnError(FeatureVector, String, Set) to decide whether to train an instance.
showDuration(long) - Static method in class de.fu_berlin.ties.util.Util
Calculated and prints the time passed since a given start time (in English).
showElement(Element) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Builds a simple partial representation of an element, containing the name of the element and its normalized and shortened textual content.
showToken(Element, String) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Builds a simple partial representation of a textual token in an element, containing the name of the element and the normalized and shortened text of the token.
shuffle(List<T>) - Static method in class de.fu_berlin.ties.eval.ShuffleGenerator
Static utility method that "shuffles" a list by re-arranging its elements in random order.
ShuffleGenerator - Class in de.fu_berlin.ties.eval
Arranges all input arguments (for example, files or URLs) in random "shuffles", so they can subsequently processed in random (but fixed) order.
ShuffleGenerator() - Constructor for class de.fu_berlin.ties.eval.ShuffleGenerator
Creates a new instance from the standard configuration.
ShuffleGenerator(TiesConfiguration) - Constructor for class de.fu_berlin.ties.eval.ShuffleGenerator
Creates a new instance from the provided configuration.
ShuffleGenerator(int, int, String, String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.eval.ShuffleGenerator
Creates a new instance from the provided configuration.
sigmoid(float, float, float) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Converts the raw score (activation value) to a value in the range from 0 to 1 via a sigmoid function depending on the threshold theta.
SimpleRepresentation - Class in de.fu_berlin.ties.context
A simple representation of an text in an element in an XML document.
SimpleRepresentation() - Constructor for class de.fu_berlin.ties.context.SimpleRepresentation
Creates a new instance based on the standard configuration.
SimpleRepresentation(TiesConfiguration) - Constructor for class de.fu_berlin.ties.context.SimpleRepresentation
Creates a new instance based on the provided configuration.
SimpleRepresentation(TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.context.SimpleRepresentation
Creates a new instance based on the provided configuration.
SimpleRepresentation(int, int, int, String, TextTokenizer) - Constructor for class de.fu_berlin.ties.context.SimpleRepresentation
Creates a new instance.
SINGLE_LINE_WHITESPACE - Static variable in class de.fu_berlin.ties.xml.TagIsolator
The class of whitespace characters that are not line separators.
SINGLE_LINE_WS - Static variable in class de.fu_berlin.ties.text.TextUtils
A regular expression matching a non-line-breaking whitespace character (character class containing space and tab).
size() - Method in class de.fu_berlin.ties.classify.feature.FeatureSet
Returns the number of features stored in this vector.
size() - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Returns the number of features stored in this vector.
size() - Method in class de.fu_berlin.ties.classify.PredictionDistribution
Returns the number of predictions (classes) in this distribution.
size() - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Returns the number of features currently stored by this instance.
size() - Method in class de.fu_berlin.ties.context.PriorRecognitions
Returns the number of recognitions stored in the buffer.
size() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns the number of entries stored in this container.
size() - Method in class de.fu_berlin.ties.text.TokenContainer
Returns the token number of tokens counted by this instances (including duplicates).
size() - Method in class de.fu_berlin.ties.text.TokenCounter
Returns the token number of tokens counted by this instances (including duplicates).
size() - Method in class de.fu_berlin.ties.util.MultiValueMap
Returns the number of key-value mappings in this map.
size(K) - Method in class de.fu_berlin.ties.util.MultiValueMap
Gets the size of the collection mapped to the specified key.
skip() - Method in class de.fu_berlin.ties.extract.ExtractorBase
This method is called by FilteringTokenWalker whenever some tokens are skipped.
skip() - Method in interface de.fu_berlin.ties.filter.SkipHandler
This method is called by FilteringTokenWalker whenever some tokens are skipped.
SkipHandler - Interface in de.fu_berlin.ties.filter
A handler that is called by FilteringTokenWalker whenever some tokens are skipped.
sortedKeys() - Method in class de.fu_berlin.ties.TiesConfiguration
Returns the list of keys contained in this configuration, sorted in alphabetic order.
sortedKeys(boolean) - Method in class de.fu_berlin.ties.TiesConfiguration
Returns the list of keys contained in this configuration, sorted in alphabetic order.
SortedMultiValueMap<K,V> - Class in de.fu_berlin.ties.util
A MultiValueMap that sorts the values stored for each key, discarding duplicates.
SortedMultiValueMap() - Constructor for class de.fu_berlin.ties.util.SortedMultiValueMap
Creates a new instance, wrapping a TreeMap.
SortedMultiValueMap(Map<K, Collection<V>>) - Constructor for class de.fu_berlin.ties.util.SortedMultiValueMap
Creates a new instance.
sortForPruning(Map.Entry[]) - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Sorts an array of map entries that are candidates for being pruned (deleted). This implementation sorts the candidate by the deviation of their weights from the initial weights, so candidates with lower deviation will be pruned first.
sortForPruning(Map.Entry[]) - Method in interface de.fu_berlin.ties.util.Pruner
Sorts an array of map entries that are candidates for being pruned (deleted).
splitLines(CharSequence) - Static method in class de.fu_berlin.ties.text.TextUtils
Splits a text into an array of lines.
splitLinesExact(CharSequence) - Static method in class de.fu_berlin.ties.text.TextUtils
Splits a text into an array of lines, without trimming lines and discarding empty lines.
splitString(String) - Static method in class de.fu_berlin.ties.text.TextUtils
Splits a string around whitespace.
splitString(String, int) - Static method in class de.fu_berlin.ties.text.TextUtils
Splits a string around whitespace.
splitString(String, Pattern, int) - Static method in class de.fu_berlin.ties.text.TextUtils
Splits a string around whitespace.
SPURIOUS - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for spurious predictions (false positives).
SPURIOUS_AMP - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
A "&" that is not the start of an entity and thus must be escaped.
STANDARD_BLOCK_SIZE - Static variable in class de.fu_berlin.ties.io.IOUtils
The standard block size recommended for I/O.
STANDARD_HTTP_CHARSET - Static variable in class de.fu_berlin.ties.io.IOUtils
The standard character set defined by the HTTP standard.
STANDARD_UNICODE_CHARSET - Static variable in class de.fu_berlin.ties.io.IOUtils
The standard Unicode-compatible character set.
START_TAG - Static variable in class de.fu_berlin.ties.xml.TagConstituent
Type constant: a start tag.
startOfExtraction(String, int) - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Whether the current token starts a new extraction.
startTagCount(String) - Method in class de.fu_berlin.ties.xml.UnprocessedTags
Returns the number of start tags with the specified name contained in this container.
state() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
The current state (state of the last item).
Storable - Interface in de.fu_berlin.ties.io
Classes implementing this interface support serialization in a human-readable format, by storing all relevant fields in a FieldMap.
StorableContainer - Interface in de.fu_berlin.ties.io
Classes implementing this interface contain multiple Storable objects and support serialization of these objects in a human-readable format, by storing them in a FieldContainer.
store() - Method in class de.fu_berlin.ties.classify.feature.DefaultFeatureVector
Returns the collection used for storing the features.
store() - Method in class de.fu_berlin.ties.classify.feature.FeatureSet
Returns the collection used for storing the features.
store() - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Returns the collection used for storing the features.
store(Writer) - Method in class de.fu_berlin.ties.io.DelimSepValues
Serializes contents as delimiter-separated values.
store(OutputStream) - Method in class de.fu_berlin.ties.io.FieldContainer
Serializes contents by wrapping the stream in a writer with UTF-8 character set and delegating to FieldContainer.store(Writer).
store(Writer) - Method in class de.fu_berlin.ties.io.FieldContainer
Subclasses can overwrite this method to serialize their contents in a class-specific format.
storeEntries(FieldContainer) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Adds all relevant Storable items in this object to a field container for serialization.
storeEntries(FieldContainer) - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Adds all extractions stored in this instance to a field container for serialization.
storeEntries(FieldContainer) - Method in interface de.fu_berlin.ties.io.StorableContainer
Adds all relevant Storable items in this object to a field container for serialization.
storeFields() - Method in class de.fu_berlin.ties.classify.Prediction
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in class de.fu_berlin.ties.eval.Accuracy
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in class de.fu_berlin.ties.eval.FeatureCount
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in class de.fu_berlin.ties.eval.FMetrics
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in class de.fu_berlin.ties.eval.SummaryFMetrics
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in class de.fu_berlin.ties.extract.Extraction
Stores all relevant fields of this object in a field map for serialization.
storeFields() - Method in interface de.fu_berlin.ties.io.Storable
Stores all relevant fields of this object in a field map for serialization.
strength(Feature) - Method in class de.fu_berlin.ties.classify.feature.FeatureSet
Returns a strength value for a feature contained in this vector.
strength(Feature) - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Returns a strength value for a feature contained in this vector.
STRENGTH_CONSTANT - Static variable in class de.fu_berlin.ties.classify.feature.FeatureSet
Constants specifying that feature frequencies are not considered when calculating strength values.
STRENGTH_LINEAR - Static variable in class de.fu_berlin.ties.classify.feature.FeatureSet
Constants specifying that feature frequencies are considered linear (as is) when calculating strength values (a feature occurring twice as frequently will be twice as strong).
STRENGTH_LOG - Static variable in class de.fu_berlin.ties.classify.feature.FeatureSet
Constants specifying that the logarithm of feature frequencies is considered when calculating strength values (1.0 is added to log(f) to avoid the result becoming 0).
STRENGTH_SQUARE_ROOT - Static variable in class de.fu_berlin.ties.classify.feature.FeatureSet
Constants specifying that the square root of feature frequencies is considered when calculating strength values.
subset(String) - Method in class de.fu_berlin.ties.TiesConfiguration
Create an Configuration object that is a subset of this one.
SummaryFMetrics - Class in de.fu_berlin.ties.eval
FMetrics extension that additionally calculates a StatisticalSummary of the intermediate precision, recall, and F1 metrics resulting from different update operations.
SummaryFMetrics() - Constructor for class de.fu_berlin.ties.eval.SummaryFMetrics
Creates a new empty instance.
SummaryFMetrics(FieldMap) - Constructor for class de.fu_berlin.ties.eval.SummaryFMetrics
Creates a new instance from a field map, fulfilling the Storable contract.
switchToNextExtraction() - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Switches to the next extraction, updating the current extraction and related fields.

T

TagConstituent - Class in de.fu_berlin.ties.xml
A tag in an XML document.
TagConstituent(short, String) - Constructor for class de.fu_berlin.ties.xml.TagConstituent
Creates a new instance, automatically generating a suitable representation without attributes (i.e.
TagConstituent(short, String, int) - Constructor for class de.fu_berlin.ties.xml.TagConstituent
Creates a new instance, automatically generating a suitable representation without attributes (i.e.
TagConstituent(short, String, String) - Constructor for class de.fu_berlin.ties.xml.TagConstituent
Creates a new instance, without storing a markup series number.
TagConstituent(short, String, String, int) - Constructor for class de.fu_berlin.ties.xml.TagConstituent
Creates a new instance, without setting a reference to a next constituent.
TagConstituent(short, String, String, int, XMLConstituent, XMLConstituent) - Constructor for class de.fu_berlin.ties.xml.TagConstituent
Creates a new instance.
TagContainer - Class in de.fu_berlin.ties.xml
This class manages tag sequences by keeping track of names and appearances of tags.
TagContainer() - Constructor for class de.fu_berlin.ties.xml.TagContainer
Creates a new instance.
TagIsolator - Class in de.fu_berlin.ties.xml
"Isolates" XML tags and textual contents in an XML/HTML document by printing each XML/HTML tag on a single line.
TagIsolator() - Constructor for class de.fu_berlin.ties.xml.TagIsolator
Creates a new instance.
tagSentences(String) - Method in class de.fu_berlin.ties.preprocess.TreeTagger
Adds tags to mark the sentences in a document.
TagVariety - Class in de.fu_berlin.ties.xml
Type-safe enumeration of the three variaties of tags employed for XML adjustment.
TargetClass - Class in de.fu_berlin.ties.extract
A class (type) of entities to recognize.
TargetClass(String) - Constructor for class de.fu_berlin.ties.extract.TargetClass
Non-public constructor -- should only be used by subclasses and related classes.
TargetStructure - Class in de.fu_berlin.ties.extract
A target structure manages the classes of entities to recognize.
TargetStructure() - Constructor for class de.fu_berlin.ties.extract.TargetStructure
Creates a new instance using the standard configuration.
TargetStructure(Configuration) - Constructor for class de.fu_berlin.ties.extract.TargetStructure
Creates a new instance using the provided configuration.
TaskRunner - Class in de.fu_berlin.ties.util
Asynchronously executes any number of Runnable tasks.
TaskRunner(String) - Constructor for class de.fu_berlin.ties.util.TaskRunner
Creates a new instance, using the default priority (Thread.NORM_PRIORITY) for threads.
TaskRunner(String, int) - Constructor for class de.fu_berlin.ties.util.TaskRunner
Creates a new instance.
TEMPLATE_DIR - Static variable in class de.fu_berlin.ties.util.VelocityService
The template directory.
TEMPLATE_EXT - Static variable in class de.fu_berlin.ties.util.VelocityService
The extension of template files.
TENTATIVE - Static variable in class de.fu_berlin.ties.xml.TagVariety
Constant for tentative start tags that were created when an end tag is followed by another end tag of the same type without a start tag of this type between them.
TEXT - Static variable in class de.fu_berlin.ties.context.FeatureType
Constant for features representing textual content (marker is empty just as for elements).
TEXT - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: textual content.
TextProcessor - Class in de.fu_berlin.ties
Abstract base class for a Processor that operates on text documents.
TextProcessor(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.TextProcessor
Creates a new instance.
TextTokenizer - Class in de.fu_berlin.ties.text
Splits a text into a sequence of tokens.
TextTokenizer(String[], String, CharSequence) - Constructor for class de.fu_berlin.ties.text.TextTokenizer
Creates a new instance.
TextUtils - Class in de.fu_berlin.ties.text
A static class that provides utility constants and methods for working with texts and regular expressions.
threshold(float) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Calculates the theshold (theta) to use for classification.
TiesConfiguration - Class in de.fu_berlin.ties
A composite configuration that allows to localize of keys and to querying types and descriptions of entries.
TiesConfiguration() - Constructor for class de.fu_berlin.ties.TiesConfiguration
Creates a new empty instance.
TiesConfiguration(Configuration, Configuration) - Constructor for class de.fu_berlin.ties.TiesConfiguration
Creates a new instance, delegating to TiesConfiguration.addConfiguration(Configuration, Configuration).
TiesConfiguration(String) - Constructor for class de.fu_berlin.ties.TiesConfiguration
Creates a new instance, delegating to TiesConfiguration.load(String).
TiesConfiguration.EntryDescriptor - Class in de.fu_berlin.ties
An inner class wrapping descriptor information on an entry: type of the entry, whether it is optional or a list, a description of the entry.
toElement() - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Stores all relevant fields of this object in an XML element for serialization. Subclasses of TrainableClassifier should extend this method and the corresponding constructor from Element to ensure (de)serialization works as expected.
toElement() - Method in interface de.fu_berlin.ties.xml.io.XMLStorable
Stores all relevant fields of this object in an XML element for serialization.
TOKEN_TYPE_PATTERNS - Static variable in class de.fu_berlin.ties.context.DefaultRepresentation
A sequence map mapping used by DefaultRepresentation.calculateValuesFromText(String, String, List) to determine the "tokenType" value.
TokenContainer - Class in de.fu_berlin.ties.text
A container that keeps track of the tokens in a document.
TokenContainer(TokenizerFactory) - Constructor for class de.fu_berlin.ties.text.TokenContainer
Creates a new instance.
tokenCount() - Method in class de.fu_berlin.ties.extract.Extraction
Returns the number of tokens in this extraction.
TokenCounter - Class in de.fu_berlin.ties.text
A simple container that keeps track of the tokens in a document.
TokenCounter() - Constructor for class de.fu_berlin.ties.text.TokenCounter
Creates a new instance.
TokenDetails - Class in de.fu_berlin.ties.text
Stores details on a token in a document.
TokenDetails(String, int, int, boolean) - Constructor for class de.fu_berlin.ties.text.TokenDetails
Creates a new instance.
TokenizerFactory - Class in de.fu_berlin.ties.text
Factory for creating TextTokenizers of different types.
TokenizerFactory(TiesConfiguration) - Constructor for class de.fu_berlin.ties.text.TokenizerFactory
Creates a new instance from the TokenizerFactory.CONFIG_TOKEN_PATTERNS and TokenizerFactory.CONFIG_WHITESPACE_PATTERN keys of the provided configuration.
TokenizerFactory(TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.text.TokenizerFactory
Creates a new instance from the TokenizerFactory.CONFIG_TOKEN_PATTERNS and TokenizerFactory.CONFIG_WHITESPACE_PATTERN keys of the provided configuration, adapted by appending the suffix.
TokenProcessor - Interface in de.fu_berlin.ties.xml.dom
Interface for a processor that operates on tokens in XML documents.
TokenWalker - Class in de.fu_berlin.ties.xml.dom
Walks through a document, handing all textual tokens over to a TokenProcessor.
TokenWalker(TokenProcessor, TokenizerFactory) - Constructor for class de.fu_berlin.ties.xml.dom.TokenWalker
Creates a new instance.
toString() - Method in class de.fu_berlin.ties.classify.ExternalClassifier
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.feature.DefaultFeature
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.feature.Feature
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.feature.FeatureVector
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.feature.OSBTransformer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.feature.SBPHTransformer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.MetaClassifier
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.PredictionComparator
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.PredictionDistribution
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.Probability
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.Reranker
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.winnow.WinnowDistribution
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.classify.winnow.WinnowStore
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.combi.CombinationState
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.combi.InsideOutsideStrategy
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.AbstractRepresentation
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.DefaultRepresentation
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.ElementPosition
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.FeatureType
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.GlobalFeature
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.LocalFeature
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.PriorRecognitions.Pair
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.PriorRecognitions
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.Representation
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.sensor.ListSensor
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.context.SimpleRepresentation
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.DirectoryProcessor
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.eval.EvalStatus
Returns a string representation, printing the name of this instance.
toString() - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.eval.ShuffleGenerator
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.AnswerBuilder
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.Extractor
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.TargetClass
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.TargetStructure
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.Trainer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.TrainEval.Results
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.extract.TrainEval
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.filter.EmbeddingElements
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.filter.RepresentationFilter
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.filter.TrainableFilter
Returns a string representation of this object.
toString(Storable) - Static method in class de.fu_berlin.ties.io.BaseStorable
Returns a String representation of a Storable object, printing all field name/value pairs in the order used to insert them into the FieldMap.
toString() - Method in class de.fu_berlin.ties.io.BaseStorable
Returns a String representation of this object, printing all field name/value pairs in the order used to insert them into the FieldMap.
toString() - Method in class de.fu_berlin.ties.io.ContentType
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.io.ExtensionFilter
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.io.FieldContainer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.preprocess.PreProcessor
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.preprocess.TreeTagger
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.text.TextTokenizer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.text.TokenContainer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.text.TokenCounter
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.text.TokenDetails
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.text.TokenizerFactory
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.TextProcessor
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.TiesConfiguration.EntryDescriptor
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.util.ExternalCommand
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.util.InvertedComparator
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.util.ListEntry
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.util.MutableInt
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.util.TaskRunner
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.dom.DocumentWalker
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.dom.ElementNameFilter
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.dom.TokenWalker
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.OtherConstituent
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.TagConstituent
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.TagContainer
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.TagIsolator
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.TagVariety
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.XMLAdjuster
Returns a string representation of this object.
toString() - Method in class de.fu_berlin.ties.xml.XMLConstituent
Returns a string representation of this object.
totalSize() - Method in class de.fu_berlin.ties.util.MultiValueMap
Gets the total size of the map by counting all the values.
trailingWhitespaceCount(String) - Method in class de.fu_berlin.ties.text.TextTokenizer
Convenience method that counts the number of whitespace characters at the end of a string, according to the defined whitespace pattern.
train(FeatureVector, String) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Incorporates an item that is represented by a feature vector into the classification model.
train(Document, ExtractionContainer) - Method in class de.fu_berlin.ties.extract.Trainer
Trains the local classifier with the correct extractions of an XML document, using the provided context representation.
TrainableClassifier - Class in de.fu_berlin.ties.classify
Classifiers extending this abstract class must provide a training mechanism by implementing the TrainableClassifier.doTrain(FeatureVector, String, ContextMap) method.
TrainableClassifier(Set<String>, FeatureTransformer, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.TrainableClassifier
Creates a new instance.
TrainableFilter - Class in de.fu_berlin.ties.filter
An abstract filter that uses a trainable classifier for training.
TrainableFilter(TiesConfiguration, NodeFilter, NodeFilter, Reranker) - Constructor for class de.fu_berlin.ties.filter.TrainableFilter
Creates a new instance.
TrainableFilteringTokenWalker - Class in de.fu_berlin.ties.filter
A filtering token walker that can be trained.
TrainableFilteringTokenWalker(TokenProcessor, TokenizerFactory, TrainableFilter, SkipHandler, Oracle) - Constructor for class de.fu_berlin.ties.filter.TrainableFilteringTokenWalker
Creates a new instance, enabling training the filter.
TrainableFilteringTokenWalker(TokenProcessor, TokenizerFactory, TrainableFilter, SkipHandler, Oracle, boolean) - Constructor for class de.fu_berlin.ties.filter.TrainableFilteringTokenWalker
Creates a new instance.
trainAndEval(String[], File, File, String, Writer) - Method in class de.fu_berlin.ties.extract.TrainEval
Processes an array of files.
Trainer - Class in de.fu_berlin.ties.extract
A trainer trains a local Classifier to be used for extraction.
Trainer() - Constructor for class de.fu_berlin.ties.extract.Trainer
Creates a new instance without specifying an output extension (which isn't needed anyway, because this class doesn't produce output).
Trainer(String) - Constructor for class de.fu_berlin.ties.extract.Trainer
Creates a new instance.
Trainer(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.Trainer
Creates a new instance.
Trainer(String, File, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.Trainer
Creates a new instance.
Trainer(String, TargetStructure, TrainableClassifier[], Representation, CombinationStrategy, TokenizerFactory, TrainableFilter) - Constructor for class de.fu_berlin.ties.extract.Trainer
Creates a new instance, using the standard configuration to configure the training mode and the superclasses.
Trainer(String, TargetStructure, TrainableClassifier[], Representation, CombinationStrategy, TokenizerFactory, TrainableFilter, Set<String>, boolean, boolean, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.Trainer
Creates a new instance.
TrainEval - Class in de.fu_berlin.ties.extract
Trains an extractor and evaluates extraction quality.
TrainEval() - Constructor for class de.fu_berlin.ties.extract.TrainEval
Creates a new instance, using a default extension and the standard configuration.
TrainEval(String) - Constructor for class de.fu_berlin.ties.extract.TrainEval
Creates a new instance, using the standard configuration.
TrainEval(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.TrainEval
Creates a new instance.
TrainEval(String, float, float, int, int, boolean, int, List, int, boolean, TiesConfiguration) - Constructor for class de.fu_berlin.ties.extract.TrainEval
Creates a new instance.
TrainEval.Results - Class in de.fu_berlin.ties.extract
An inner class wrapping the results of a training + evaluation run.
TrainEval.Results() - Constructor for class de.fu_berlin.ties.extract.TrainEval.Results
Creates a new instance.
trainIfNecessary(Element, boolean) - Method in class de.fu_berlin.ties.filter.TrainableFilter
Trains the correct decision for an element by calling the TrainableClassifier.trainOnError(FeatureVector, String, java.util.Set) method on the stored trainable classifier.
TrainingContextDetails - Class in de.fu_berlin.ties.extract.amend
Extends the ContextDetails class by also storing the expected (true) and predicted state on of a token.
TrainingContextDetails(String, int, int, boolean, FeatureVector, CombinationState, CombinationState) - Constructor for class de.fu_berlin.ties.extract.amend.TrainingContextDetails
Creates a new instance.
TrainingContextDetails(TokenDetails, FeatureVector, CombinationState, CombinationState) - Constructor for class de.fu_berlin.ties.extract.amend.TrainingContextDetails
Creates a new instance, re-using field values from a TokenDetails instance.
trainOnError(FeatureVector, String, Set) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Handles error-driven learning ("train only errors"): the specified feature vector is trained into the model only if the predicted class for the feature vector differs from the specified target class.
trainOnErrorHook(PredictionDistribution, FeatureVector, String, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.MetaClassifier
Subclasses can implement this hook for more refined error-driven learning.
trainOnErrorHook(PredictionDistribution, FeatureVector, String, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.MultiBinaryClassifier
Subclasses can implement this hook for more refined error-driven learning.
trainOnErrorHook(PredictionDistribution, FeatureVector, String, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.OneAgainstTheRestClassifier
Subclasses can implement this hook for more refined error-driven learning.
trainOnErrorHook(PredictionDistribution, FeatureVector, String, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.TrainableClassifier
Subclasses can implement this hook for more refined error-driven learning.
trainOnErrorHook(PredictionDistribution, FeatureVector, String, Set, ContextMap) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Hook implementing error-driven learning, promoting and demoting weights as required.
transform(FeatureVector) - Method in class de.fu_berlin.ties.classify.feature.FeatureTransformer
Transforms a feature vector.
translateCurrentState(CombinationState) - Method in class de.fu_berlin.ties.combi.BeginAfterStrategy
Translates an expected class to a class name to feed to the classifiers in training mode.
translateCurrentState(CombinationState) - Method in class de.fu_berlin.ties.combi.BeginEndStrategy
Translates an expected class to a class name to feed to the classifiers in training mode.
translateCurrentState(CombinationState) - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Translates an expected class to a class name to feed to the classifiers in training mode.
translateCurrentState(CombinationState) - Method in class de.fu_berlin.ties.combi.InsideOutsideStrategy
Translates an expected class to a class name to feed to the classifiers in training mode.
translateCurrentState(CombinationState) - Method in class de.fu_berlin.ties.combi.OpenCloseStrategy
Translates an expected class to a class name to feed to the classifiers in training mode.
translateCurrentState(CombinationState) - Method in class de.fu_berlin.ties.combi.TrivialStrategy
Translates an expected class to a class name to feed to the classifiers in training mode.
translateResult(PredictionDistribution[]) - Method in class de.fu_berlin.ties.combi.BeginAfterStrategy
Translates the classes predicted by the classifiers in a decision for the caller.
translateResult(PredictionDistribution[]) - Method in class de.fu_berlin.ties.combi.BeginEndStrategy
Translates the classes predicted by the classifiers in a decision for the caller.
translateResult(PredictionDistribution[]) - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Translates the classes predicted by the classifiers in a decision for the caller.
translateResult(PredictionDistribution[]) - Method in class de.fu_berlin.ties.combi.InsideOutsideStrategy
Translates the classes predicted by the classifiers in a decision for the caller.
translateResult(PredictionDistribution[]) - Method in class de.fu_berlin.ties.combi.OpenCloseStrategy
Translates the classes predicted by the classifiers in a decision for the caller.
translateResult(PredictionDistribution[]) - Method in class de.fu_berlin.ties.combi.TrivialStrategy
Translates the classes predicted by the classifiers in a decision for the caller.
TreeTagger - Class in de.fu_berlin.ties.preprocess
Integrates the TreeTagger, a linguistic tool for part-of-speech tagging and chunk parsing.
TreeTagger(String) - Constructor for class de.fu_berlin.ties.preprocess.TreeTagger
Creates a new instance, using the standard configuration.
TreeTagger(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.preprocess.TreeTagger
Creates a new instance.
TrivialStrategy - Class in de.fu_berlin.ties.combi
The trivial combination strategy simply uses a single tag for each class; no prefixes are used.
TrivialStrategy(Set<String>) - Constructor for class de.fu_berlin.ties.combi.TrivialStrategy
Creates a new instance.
TRUTH - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for items defined to be true (answer keys, "gold standard").
tryToClose(InputStream) - Static method in class de.fu_berlin.ties.io.IOUtils
Convenience method for closing an input stream.
tryToClose(OutputStream) - Static method in class de.fu_berlin.ties.io.IOUtils
Convenience method for closing an output stream.
tryToClose(Reader) - Static method in class de.fu_berlin.ties.io.IOUtils
Convenience method for closing a reader.
tryToClose(Writer) - Static method in class de.fu_berlin.ties.io.IOUtils
Convenience method for closing a writer.
TYPE_EVAL - Static variable in class de.fu_berlin.ties.extract.TrainEval
Serialization value for the "Eval" type.
TYPE_TRAIN - Static variable in class de.fu_berlin.ties.extract.TrainEval
Serialization value for the "Train" type.
types() - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Returns the set of all types (Strings) currently stored in this instance.
types() - Method in interface de.fu_berlin.ties.eval.MultiFMetricsView
Returns the set of all types (Strings) currently stored in this instance.

U

UC_SUFFIX - Static variable in class de.fu_berlin.ties.classify.winnow.UltraconservativeWinnow
Optional prefix used to give values for the configuration parameters of this classifier that differ from the values used by Winnow.
UltraconservativeWinnow - Class in de.fu_berlin.ties.classify.winnow
A combination of Winnow with the "ultraconservative" approach proposed by Koby Crammer and Yoram Singer.
UltraconservativeWinnow(Set<String>) - Constructor for class de.fu_berlin.ties.classify.winnow.UltraconservativeWinnow
Creates a new instance by delegating to the corresponding super constructor.
UltraconservativeWinnow(Set<String>, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.winnow.UltraconservativeWinnow
Creates a new instance by delegating to the corresponding super constructor.
UltraconservativeWinnow(Set<String>, FeatureTransformer, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.winnow.UltraconservativeWinnow
Creates a new instance by delegating to the corresponding super constructor.
UltraconservativeWinnow(Set<String>, FeatureTransformer, boolean, float, float, float, TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.classify.winnow.UltraconservativeWinnow
Creates a new instance by delegating to the corresponding super constructor.
UNKNOWN - Static variable in class de.fu_berlin.ties.eval.EvalStatus
Constant for predictions whose evaluation status is unknown (not yet evaluated).
UnprocessedTags - Class in de.fu_berlin.ties.xml
A container that keeps track of unprocessed start and end tags during XML adjustment.
UnprocessedTags() - Constructor for class de.fu_berlin.ties.xml.UnprocessedTags
Creates a new instance.
UNQUOTED_ATTRIB_CHARS - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Pattern string specifying characters that can occur at the start of end of an unquoted attribute value: everything except '<', '>', '=' and whitespace (whitespace is also allowed, but only in the middle of a value).
UNQUOTED_ATTRIBUTE - Static variable in class de.fu_berlin.ties.xml.XMLAdjuster
Pattern string specifying an XML attribute without proper quotes.
unsetJavaClassAttrib() - Method in class de.fu_berlin.ties.xml.io.ObjectElement
Unsets the attribute representing the Class of the stored object.
unsetPositions() - Method in class de.fu_berlin.ties.extract.ExtractionContainer
Unsets the positions of all stored extractions.
update(FeatureVector) - Method in class de.fu_berlin.ties.eval.FeatureCount
Evaluates a feature vector and updates the statistics accordingly.
update(EvalInput) - Method in class de.fu_berlin.ties.eval.FMetrics
Updates the statistics, increasing the stored values as specified.
update(long, long, long) - Method in class de.fu_berlin.ties.eval.FMetrics
Updates the statistics, increasing the stored values as specified.
update(MultiFMetricsView) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Updates the statistics by adding the contents of the given multi-metrics.
update(String, EvalInput) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Updates the statistics for a specified type, increasing the stored values as specified.
update(String, long, long, long) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Updates the statistics for a specified type, increasing the stored values as specified.
update(long, long, long) - Method in class de.fu_berlin.ties.eval.SummaryFMetrics
Updates the statistics, increasing the stored values as specified and updating the statistical summaries.
updateExtraction(String, int) - Method in class de.fu_berlin.ties.extract.ExtractionLocator
Updates the currently processed extraction.
updateScores(Feature, double, float[]) - Method in class de.fu_berlin.ties.classify.winnow.Winnow
Updates the score (activation values) for all classes by adding the weights of a feature.
updateState(CombinationState) - Method in class de.fu_berlin.ties.combi.CombinationStrategy
Updates the current state (the class of the last item).
updateState(Element, String, String, String) - Method in class de.fu_berlin.ties.extract.ExtractorBase
Helper that build the featuresand determines the active classesfor an element.
Util - Class in de.fu_berlin.ties.util
A static class that provides general utility methods.
Util() - Constructor for class de.fu_berlin.ties.util.Util
Util instances should NOT be constructed in standard programming.

V

values() - Method in class de.fu_berlin.ties.util.MultiValueMap
Gets a collection containing all the values in the map.
VelocityService - Class in de.fu_berlin.ties.util
A static class that provides a convenience interface to a Velocity singletons.
VERSION_NO - Static variable in class de.fu_berlin.ties.Main
The current version number of this software.
view(String) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Returns a read-only view of the FMetrics of the specified type.
view(String) - Method in interface de.fu_berlin.ties.eval.MultiFMetricsView
Returns a read-only view of the FMetrics of the specified type.
viewAll() - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Returns a read-only view of the FMetrics containing the sums and averages over all types.
viewAll() - Method in interface de.fu_berlin.ties.eval.MultiFMetricsView
Returns a read-only view of the FMetrics containing the sums and averages over all types.
viewAllSummary() - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Optional operation that shows statistical summaries of precision, recall, and F1 metrics over all types, if calculated.
viewAllSummary() - Method in interface de.fu_berlin.ties.eval.MultiFMetricsView
Optional operation that shows statistical summaries of precision, recall, and F1 metrics over all types, if calculated by the used implementation.
viewF1Summary() - Method in interface de.fu_berlin.ties.eval.FMetricsSummary
Returns a summary view on the F1 values.
viewF1Summary() - Method in class de.fu_berlin.ties.eval.SummaryFMetrics
Returns a summary view on the F1 values.
viewFeatureCount() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns a read-only view on the counted documents, contexts, and features and the calculated averages.
viewGlobalAccuracy() - Method in class de.fu_berlin.ties.extract.Trainer
Returns a view on the global (overall) accuracies measured so far (or after the last call to Trainer.resetGlobalAccuracy()) by each classifier.
viewMetrics() - Method in class de.fu_berlin.ties.extract.EvaluatedExtractionContainer
Returns a read-only view of the counts of true positives (correct extractions), false negatives (missing extractions) and false positives (spurious extractions) stored in this container of each type as well as for all types; and the metrics that can be calculated from these values.
viewPrecisionSummary() - Method in interface de.fu_berlin.ties.eval.FMetricsSummary
Returns a summary view on the precision values.
viewPrecisionSummary() - Method in class de.fu_berlin.ties.eval.SummaryFMetrics
Returns a summary view on the precision values.
viewRecallSummary() - Method in interface de.fu_berlin.ties.eval.FMetricsSummary
Returns a summary view on the recall values.
viewRecallSummary() - Method in class de.fu_berlin.ties.eval.SummaryFMetrics
Returns a summary view on the recall values.
viewRelevantPunctuation() - Method in class de.fu_berlin.ties.extract.ExtractorBase
Returns a read-only view on the set of punctuation tokens that have been found to be relevant for token classification (because they sometimes occur as the first or last token of an extraction).
viewSummary(String) - Method in class de.fu_berlin.ties.eval.MultiFMetrics
Optional operation that shows statistical summaries of precision, recall, and F1 metrics of the specified type, if calculated.
viewSummary(String) - Method in interface de.fu_berlin.ties.eval.MultiFMetricsView
Optional operation that shows statistical summaries of precision, recall, and F1 metrics of the specified type, if calculated by the used implementation.

W

walk(Document, ContextMap) - Method in class de.fu_berlin.ties.filter.FilteringTokenWalker
Walks through the contents of an XML document, tokenizing the textual contents.
walk(Document, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.DocumentWalker
Walks through the contents of an XML document, tokenizing the textual contents.
walk(Element, TokenContainer, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.DocumentWalker
Walks through the contents of a node, tokenizing textual contents and recursing through nested elements.
walk(Document, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.TokenWalker
Walks through the contents of an XML document, tokenizing the textual contents.
walk(Element, TokenCounter, TextTokenizer, ContextMap) - Method in class de.fu_berlin.ties.xml.dom.TokenWalker
Walks through the contents of a node, tokenizing textual contents and recursing through nested elements.
WHITESPACE_CONTROL_OTHER - Static variable in class de.fu_berlin.ties.text.TokenizerFactory
Pattern string capturing whitespace and control/other characters.
WHITESPACE_PATTERN - Static variable in class de.fu_berlin.ties.text.TextUtils
A simple regular expression for whitespace.
Winnow - Class in de.fu_berlin.ties.classify.winnow
Classifier implementing the Winnow algorithm (Nick Littlestone).
Winnow(Set<String>) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance based on the standard configuration.
Winnow(Set<String>, String) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance based on the standard configuration.
Winnow(Set<String>, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance based on the provided configuration.
Winnow(Set<String>, TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance based on the provided configuration.
Winnow(Set<String>, FeatureTransformer, TiesConfiguration) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance based on the provided configuration.
Winnow(Set<String>, FeatureTransformer, TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance based on the provided configuration.
Winnow(Set<String>, FeatureTransformer, boolean, float, float, float, TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.classify.winnow.Winnow
Creates a new instance.
WinnowDistribution - Class in de.fu_berlin.ties.classify.winnow
A distribution over the classes predicted by a Winnow classifier.
WinnowDistribution(float, float) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowDistribution
Creates a new instance.
WinnowPrediction - Class in de.fu_berlin.ties.classify.winnow
A prediction that also stores a raw score (activation value) and a sigmoid score, as calculated by the Winnow algorithm.
WinnowPrediction(FieldMap) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Creates a new instance from a field map, fulfilling the Storable contract.
WinnowPrediction(String, double, float, float) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Creates a new instance, setting the evaluation status to EvalStatus.UNKNOWN.
WinnowPrediction(String, double, float, float, EvalStatus) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Creates a new instance, without setting a source ID.
WinnowPrediction(String, String, double, float, float, EvalStatus) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowPrediction
Creates a new instance.
WinnowStore - Class in de.fu_berlin.ties.classify.winnow
Feature store used by Winnow.
WinnowStore(float, TiesConfiguration, String) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowStore
Creates a new instance.
WinnowStore(float, int, int, int) - Constructor for class de.fu_berlin.ties.classify.winnow.WinnowStore
Creates a new instance.
writeDocument(Document, OutputStream) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Writes an XML document to a given stream.
writeDocument(Document, OutputStreamWriter) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Writes an XML document to a given writer, using the character set of the underlying output stream.
writeDocument(Document, Writer, String) - Static method in class de.fu_berlin.ties.xml.dom.DOMUtils
Writes an XML document to a given writer, using the given character set.
writeDocument(Document, Writer) - Method in interface de.fu_berlin.ties.xml.io.DocumentSerializer
Writes an XML document to a given writer.
writeDocument(Document, OutputStream) - Method in interface de.fu_berlin.ties.xml.io.DocumentSerializer
Writes an XML document to a given stream .
writeToWriter(CharSequence, Writer) - Static method in class de.fu_berlin.ties.io.IOUtils
Writes the contents of a character sequence to a writer.

X

XML_ATTRIBUTE - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an XML attribute (name = quoted-value pair).
XML_CDATA_SECTION - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying a CDATA section in an XML document.
XML_CDATA_TOKEN - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string for a visible textual token in XML documents (contains neither whitespace nor markup).
XML_COMMENT - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an XML comment.
XML_DOCTYPE - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an XML document type declaration.
XML_END_TAG - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an XML end tag.
XML_EQUAL_SIGN - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string for the '=' sign, optionally surrounded by whitespace.
XML_EXTENSION - Static variable in class de.fu_berlin.ties.TiesConfiguration
The extension of config files in XML format.
XML_NAME - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string for XML names (according to XML 1.1).
XML_NAME_START_CHAR - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying the class of valid start characters of XML names.
XML_OPT_WHITESPACE - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern specifying optional whitespace in an XML document (zero or more whitespace characters).
XML_PATTERNS - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
The array of patterns used for shallow XML parsing.
XML_PROLOG - Static variable in class de.fu_berlin.ties.xml.OtherConstituent
Type constant: the XML prolog.
XML_PROLOG_OR_PI - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an XML prolog or processing instruction.
XML_QUOTED_STRING - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string for strings enclosed in full or half quotes, e.g.
XML_START_OR_EMPTY_TAG - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying an XML start or empty tag (combined into a single pattern to avoid unnecessary backtracking).
XML_TEXTUAL_CONTENT - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern string specifying textual content (character data) in an XML document.
XML_WHITESPACE - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern specifying whitespace in an XML document (one or more whitespace characters).
XML_WHITESPACE_CHARS - Static variable in class de.fu_berlin.ties.xml.XMLTokenizerFactory
Pattern fragment listing allowed whitespace characters in an XML document.
XMLAdjuster - Class in de.fu_berlin.ties.xml
This class tries to fix corrupt XML documents, especially documents containing nesting errors.
XMLAdjuster() - Constructor for class de.fu_berlin.ties.xml.XMLAdjuster
Creates a new instance using a default extension and the standard configuration.
XMLAdjuster(String) - Constructor for class de.fu_berlin.ties.xml.XMLAdjuster
Creates a new instance, configured from the standard configuration.
XMLAdjuster(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.xml.XMLAdjuster
Creates a new instance from the provided configuration.
XMLAdjuster(String, String, Set<String>, boolean, boolean, boolean, TiesConfiguration) - Constructor for class de.fu_berlin.ties.xml.XMLAdjuster
Creates a new instance.
XMLConstituent - Class in de.fu_berlin.ties.xml
A constituent in an XML document.
XMLConstituent(short, String) - Constructor for class de.fu_berlin.ties.xml.XMLConstituent
Creates a new instance, without setting a reference to a next constituent.
XMLConstituent(short, String, XMLConstituent, XMLConstituent) - Constructor for class de.fu_berlin.ties.xml.XMLConstituent
Creates a new instance.
XMLStorable - Interface in de.fu_berlin.ties.xml.io
Classes implementing this interface support serialization in a human-readable XML-based format.
XMLStripper - Class in de.fu_berlin.ties.xml.dom
An XML stripper converts a XML document to plain text, removing all markup.
XMLStripper() - Constructor for class de.fu_berlin.ties.xml.dom.XMLStripper
Creates a new instance, using a default extension and the standard configuration.
XMLStripper(String) - Constructor for class de.fu_berlin.ties.xml.dom.XMLStripper
Creates a new instance, using the standard configuration.
XMLStripper(String, TiesConfiguration) - Constructor for class de.fu_berlin.ties.xml.dom.XMLStripper
Creates a new instance.
XMLTokenizerFactory - Class in de.fu_berlin.ties.xml
Static factory for creating a TextTokenizers for XML-like input.

A B C D E F G H I J K L M N O P Q R S T U V W X

Copyright © 2003-2004 Christian Siefkes. All Rights Reserved.