org.niocchi.gc
Class GenericCrawler
java.lang.Object
org.niocchi.gc.GenericCrawler
- All Implemented Interfaces:
- Monitorable
public class GenericCrawler
- extends java.lang.Object
- implements Monitorable
A simple crawler given as an implementation example.
It uses a SimpleFileURLPool
which reads URL from a file, one URL per line.
Methods inherited from class java.lang.Object |
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
GenericCrawler
public GenericCrawler()
init
public void init(java.lang.String seedFile)
throws java.io.IOException
- Throws:
java.io.IOException
crawl
public void crawl()
throws java.io.IOException,
java.lang.InterruptedException,
ResourceException,
URLPoolException
- Throws:
java.io.IOException
java.lang.InterruptedException
ResourceException
URLPoolException
printMonitoredState
public void printMonitoredState(java.io.PrintStream out_)
- Specified by:
printMonitoredState
in interface Monitorable
dump
public void dump()
- Specified by:
dump
in interface Monitorable
execute
public void execute(java.lang.String[] argv_)
throws java.lang.Exception
- Throws:
java.lang.Exception
main
public static void main(java.lang.String[] argv_)
throws java.lang.Exception
- Throws:
java.lang.Exception