eTextReader.search
Class TextParser

java.lang.Object
  extended by eTextReader.NotationEventAdapter
      extended by eTextReader.search.TextParser
All Implemented Interfaces:
NotationEventListener, java.io.Serializable, java.util.EventListener

public class TextParser
extends NotationEventAdapter
implements java.io.Serializable

See Also:
Serialized Form

Field Summary
private  ContentPane cp
           
private  java.lang.String currentURL
           
private  int currentWordIndex
           
protected static java.lang.String delimiters
           
private  java.util.HashMap<java.lang.String,java.lang.Integer> inverseWordMap
          Allows quick look up of the words in the wordList structure
private  java.lang.Object loadingLock
           
private  eTextReader.loggingInterface.Logger log
           
private  org.apache.log4j.Logger logger
           
private  java.util.ArrayList<WordWithAddress> parsedDocument
           
private static java.util.Set<java.lang.String> stopWords
          A set of words that shouldn't be indexed
private  java.util.ArrayList<java.lang.String> wordList
          Contains a list of the unique words in the document
 
Constructor Summary
TextParser(java.net.URL url)
           
 
Method Summary
 java.util.ArrayList getParsedDocument()
           
 java.util.List<java.lang.String> getWordList()
           
 void pageChanged(NotationEvent ne)
          Process a page change event.
 
Methods inherited from class eTextReader.NotationEventAdapter
errorOccurred, notationEventOccurred
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

stopWords

private static java.util.Set<java.lang.String> stopWords
A set of words that shouldn't be indexed


currentURL

private java.lang.String currentURL

cp

private ContentPane cp

parsedDocument

private java.util.ArrayList<WordWithAddress> parsedDocument

log

private eTextReader.loggingInterface.Logger log

loadingLock

private java.lang.Object loadingLock

wordList

private java.util.ArrayList<java.lang.String> wordList
Contains a list of the unique words in the document


inverseWordMap

private java.util.HashMap<java.lang.String,java.lang.Integer> inverseWordMap
Allows quick look up of the words in the wordList structure


currentWordIndex

private int currentWordIndex

logger

private org.apache.log4j.Logger logger

delimiters

protected static final java.lang.String delimiters
See Also:
Constant Field Values
Constructor Detail

TextParser

public TextParser(java.net.URL url)
Method Detail

pageChanged

public void pageChanged(NotationEvent ne)
Description copied from interface: NotationEventListener
Process a page change event.

Specified by:
pageChanged in interface NotationEventListener
Overrides:
pageChanged in class NotationEventAdapter

getParsedDocument

public java.util.ArrayList getParsedDocument()

getWordList

public java.util.List<java.lang.String> getWordList()