Name | Class | Output Extension | Further Arguments | Description |
---|
adjust | de.fu_berlin.ties.xml.XMLAdjuster | xml | | Tries to fix corrupt XML documents, especially documents containing nesting errors |
answers | de.fu_berlin.ties.extract.AnswerBuilder | ans | | Builds answer keys from from an annotated text (in XML format) |
class-train | de.fu_berlin.ties.classify.ClassTrain | cls | | Classifies a list of files, training the text classifier on each error |
extract | de.fu_berlin.ties.extract.Extractor | ext | | Extracts relevant information from texts |
preprocess | de.fu_berlin.ties.preprocess.PreProcessor | aug | | Preprocesses documents by converting them to a suitable XML format and adding lingustic information |
re-eval | de.fu_berlin.ties.eval.ReEvaluator | ext | | Re-evaluates evaluated extractions (useful for switching the match mode -- eval.match.all) |
shuffle | de.fu_berlin.ties.eval.ShuffleGenerator | | | Creates random "shuffles" of input arguments (e.g. files or URLs) |
strip | de.fu_berlin.ties.xml.dom.XMLStripper | txt | | Strips all markup from an XML document and stores the resulting plain text |
train | de.fu_berlin.ties.extract.Trainer | | | Trains the classifier used to extract information |
train-eval | de.fu_berlin.ties.extract.TrainEval | metrics | | Trains an extractor and evaluates extraction quality |