gate.creole.splitter
Class SentenceSplitter

java.lang.Object
  |
  +--gate.util.AbstractFeatureBearer
        |
        +--gate.creole.AbstractResource
              |
              +--gate.creole.AbstractProcessingResource
                    |
                    +--gate.creole.AbstractLanguageAnalyser
                          |
                          +--gate.creole.splitter.SentenceSplitter
All Implemented Interfaces:
ANNIEConstants, Executable, FeatureBearer, LanguageAnalyser, NameBearer, ProcessingResource, Resource, Serializable

public class SentenceSplitter
extends AbstractLanguageAnalyser

A sentence splitter. This is module similar to a gate.creole.nerc.Nerc in the fact that it conatins a tokeniser, a gazetteer and a Jape grammar. This class is used so we can have a different entry in the creole.xml file describing the default resources and to add some minor processing after running the components in order to extract the results in a usable form.

See Also:
Serialized Form

Nested Class Summary
 
Nested classes inherited from class gate.creole.AbstractProcessingResource
AbstractProcessingResource.InternalStatusListener, AbstractProcessingResource.IntervalProgressListener
 
Field Summary
private static boolean DEBUG
           
private  String encoding
           
(package private)  DefaultGazetteer gazetteer
           
private  URL gazetteerListsURL
           
private  String inputASName
           
private  String outputASName
           
static String SPLIT_DOCUMENT_PARAMETER_NAME
           
static String SPLIT_ENCODING_PARAMETER_NAME
           
static String SPLIT_GAZ_URL_PARAMETER_NAME
           
static String SPLIT_INPUT_AS_PARAMETER_NAME
           
static String SPLIT_OUTPUT_AS_PARAMETER_NAME
           
static String SPLIT_TRANSD_URL_PARAMETER_NAME
           
(package private)  Transducer transducer
           
private  URL transducerURL
           
 
Fields inherited from class gate.creole.AbstractLanguageAnalyser
corpus, document
 
Fields inherited from class gate.creole.AbstractProcessingResource
interrupted
 
Fields inherited from class gate.creole.AbstractResource
name
 
Fields inherited from class gate.util.AbstractFeatureBearer
features
 
Fields inherited from interface gate.creole.ANNIEConstants
ANNOTATION_COREF_FEATURE_NAME, DATE_ANNOTATION_TYPE, DOCUMENT_COREF_FEATURE_NAME, LOCATION_ANNOTATION_TYPE, LOOKUP_ANNOTATION_TYPE, LOOKUP_CLASS_FEATURE_NAME, LOOKUP_MAJOR_TYPE_FEATURE_NAME, LOOKUP_MINOR_TYPE_FEATURE_NAME, LOOKUP_ONTOLOGY_FEATURE_NAME, MONEY_ANNOTATION_TYPE, ORGANIZATION_ANNOTATION_TYPE, PERSON_ANNOTATION_TYPE, PERSON_GENDER_FEATURE_NAME, PR_NAMES, SENTENCE_ANNOTATION_TYPE, SPACE_TOKEN_ANNOTATION_TYPE, TOKEN_ANNOTATION_TYPE, TOKEN_CATEGORY_FEATURE_NAME, TOKEN_KIND_FEATURE_NAME, TOKEN_LENGTH_FEATURE_NAME, TOKEN_ORTH_FEATURE_NAME, TOKEN_STRING_FEATURE_NAME
 
Constructor Summary
SentenceSplitter()
           
 
Method Summary
 void execute()
          Starts the execution of this executable
 String getEncoding()
           
 URL getGazetteerListsURL()
           
 String getInputASName()
           
 String getOutputASName()
           
 URL getTransducerURL()
           
 Resource init()
          Initialise this resource, and return it.
 void interrupt()
          Notifies all the PRs in this controller that they should stop their execution as soon as possible.
 void setEncoding(String newEncoding)
           
 void setGazetteerListsURL(URL newGazetteerListsURL)
           
 void setInputASName(String newInputASName)
           
 void setOutputASName(String newOutputASName)
           
 void setTransducerURL(URL newTransducerURL)
           
 
Methods inherited from class gate.creole.AbstractLanguageAnalyser
getCorpus, getDocument, setCorpus, setDocument
 
Methods inherited from class gate.creole.AbstractProcessingResource
addProgressListener, addStatusListener, cleanup, fireProcessFinished, fireProgressChanged, fireStatusChanged, isInterrupted, reInit, removeProgressListener, removeStatusListener
 
Methods inherited from class gate.creole.AbstractResource
checkParameterValues, getName, getParameterValue, getParameterValue, removeResourceListeners, setName, setParameterValue, setParameterValue, setParameterValues, setParameterValues, setResourceListeners
 
Methods inherited from class gate.util.AbstractFeatureBearer
getFeatures, setFeatures
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface gate.ProcessingResource
reInit
 
Methods inherited from interface gate.Resource
cleanup, getParameterValue, setParameterValue, setParameterValues
 
Methods inherited from interface gate.util.FeatureBearer
getFeatures, setFeatures
 
Methods inherited from interface gate.util.NameBearer
getName, setName
 
Methods inherited from interface gate.Executable
isInterrupted
 

Field Detail

SPLIT_DOCUMENT_PARAMETER_NAME

public static final String SPLIT_DOCUMENT_PARAMETER_NAME
See Also:
Constant Field Values

SPLIT_INPUT_AS_PARAMETER_NAME

public static final String SPLIT_INPUT_AS_PARAMETER_NAME
See Also:
Constant Field Values

SPLIT_OUTPUT_AS_PARAMETER_NAME

public static final String SPLIT_OUTPUT_AS_PARAMETER_NAME
See Also:
Constant Field Values

SPLIT_ENCODING_PARAMETER_NAME

public static final String SPLIT_ENCODING_PARAMETER_NAME
See Also:
Constant Field Values

SPLIT_GAZ_URL_PARAMETER_NAME

public static final String SPLIT_GAZ_URL_PARAMETER_NAME
See Also:
Constant Field Values

SPLIT_TRANSD_URL_PARAMETER_NAME

public static final String SPLIT_TRANSD_URL_PARAMETER_NAME
See Also:
Constant Field Values

gazetteer

DefaultGazetteer gazetteer

transducer

Transducer transducer

transducerURL

private URL transducerURL

encoding

private String encoding

gazetteerListsURL

private URL gazetteerListsURL

DEBUG

private static final boolean DEBUG
See Also:
Constant Field Values

inputASName

private String inputASName

outputASName

private String outputASName
Constructor Detail

SentenceSplitter

public SentenceSplitter()
Method Detail

init

public Resource init()
              throws ResourceInstantiationException
Description copied from interface: Resource
Initialise this resource, and return it.

Specified by:
init in interface Resource
Overrides:
init in class AbstractProcessingResource
ResourceInstantiationException

execute

public void execute()
             throws ExecutionException
Description copied from interface: Executable
Starts the execution of this executable

Specified by:
execute in interface Executable
Overrides:
execute in class AbstractProcessingResource
ExecutionException

interrupt

public void interrupt()
Notifies all the PRs in this controller that they should stop their execution as soon as possible.

Specified by:
interrupt in interface Executable
Overrides:
interrupt in class AbstractProcessingResource

setTransducerURL

public void setTransducerURL(URL newTransducerURL)

getTransducerURL

public URL getTransducerURL()

setEncoding

public void setEncoding(String newEncoding)

getEncoding

public String getEncoding()

setGazetteerListsURL

public void setGazetteerListsURL(URL newGazetteerListsURL)

getGazetteerListsURL

public URL getGazetteerListsURL()

setInputASName

public void setInputASName(String newInputASName)

getInputASName

public String getInputASName()

setOutputASName

public void setOutputASName(String newOutputASName)

getOutputASName

public String getOutputASName()