|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--javax.swing.text.html.HTMLEditorKit.ParserCallback | +--gate.util.HtmlLinksExtractor
This class extracts links from HTML files. It has been hacked to build the contents of http://gate.ac.uk/sitemap.html; you probably don't want to use it for anything else!
Implements the behaviour of the HTML reader. Methods of an object of this class are called by the HTML parser when events will appear.
Field Summary | |
private HTML.Tag |
currentTag
The tag currently being processed |
(package private) static String |
currFile
Name of the file we're currently processing |
(package private) static String |
currPath
Path to the file we're currently processing |
private static boolean |
DEBUG
Debug flag |
(package private) static String |
endUl
will contain </UL> after first title |
(package private) static boolean |
firstTitle
whether we've done a title before |
Fields inherited from class javax.swing.text.html.HTMLEditorKit.ParserCallback |
IMPLIED |
Constructor Summary | |
HtmlLinksExtractor()
|
Method Summary | |
void |
flush()
This method is called once, when the HTML parser reaches the end of its input streamin order to notify the parserCallback that there is nothing more to parse. |
void |
handleComment(char[] text,
int pos)
This method is called when the HTML parser encounts a comment |
void |
handleEndTag(HTML.Tag t,
int pos)
This method is called when the HTML parser encounts the end of a tag that means that the tag is paired by a beginning tag |
void |
handleError(String errorMsg,
int pos)
This method is called when the HTML parser encounts an error it depends on the programmer if he wants to deal with that error |
void |
handleSimpleTag(HTML.Tag t,
MutableAttributeSet a,
int pos)
This method is called when the HTML parser encounts an empty tag |
void |
handleStartTag(HTML.Tag t,
MutableAttributeSet a,
int pos)
This method is called when the HTML parser encounts the beginning of a tag that means that the tag is paired by an end tag and it's not an empty one. |
void |
handleText(char[] text,
int pos)
This method is called when the HTML parser encounts text (PCDATA) |
private static List |
listAllFiles(File aFile,
Set foldersToIgnore)
Given a certain folder it lists recursively all the files contained in that folder. |
private static void |
listFilesRec(File aFile,
List fileNames,
List foldersToExplore,
Set foldersToIgnore)
Helper method for listAllFiles |
static void |
main(String[] args)
Extract links from all .html files below a directory |
private void |
printAttributes(MutableAttributeSet a)
|
Methods inherited from class javax.swing.text.html.HTMLEditorKit.ParserCallback |
handleEndOfLineString |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
private static final boolean DEBUG
private HTML.Tag currentTag
static boolean firstTitle
static String endUl
static String currFile
static String currPath
Constructor Detail |
public HtmlLinksExtractor()
Method Detail |
public void handleStartTag(HTML.Tag t, MutableAttributeSet a, int pos)
handleStartTag
in class HTMLEditorKit.ParserCallback
private void printAttributes(MutableAttributeSet a)
public void handleEndTag(HTML.Tag t, int pos)
handleEndTag
in class HTMLEditorKit.ParserCallback
public void handleSimpleTag(HTML.Tag t, MutableAttributeSet a, int pos)
handleSimpleTag
in class HTMLEditorKit.ParserCallback
public void handleText(char[] text, int pos)
handleText
in class HTMLEditorKit.ParserCallback
public void handleError(String errorMsg, int pos)
handleError
in class HTMLEditorKit.ParserCallback
public void flush() throws BadLocationException
flush
in class HTMLEditorKit.ParserCallback
BadLocationException
public void handleComment(char[] text, int pos)
handleComment
in class HTMLEditorKit.ParserCallback
private static List listAllFiles(File aFile, Set foldersToIgnore)
private static void listFilesRec(File aFile, List fileNames, List foldersToExplore, Set foldersToIgnore)
public static void main(String[] args)
|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |