org.niocchi.gc
Class GenericCrawler

java.lang.Object
  extended by org.niocchi.gc.GenericCrawler
All Implemented Interfaces:
Monitorable

public class GenericCrawler
extends java.lang.Object
implements Monitorable

A simple crawler given as an implementation example.
It uses a SimpleFileURLPool which reads URL from a file, one URL per line.


Constructor Summary
GenericCrawler()
           
 
Method Summary
 void crawl()
           
 void dump()
           
 void execute(java.lang.String[] argv_)
           
 void init(java.lang.String seedFile)
           
static void main(java.lang.String[] argv_)
           
 void printMonitoredState(java.io.PrintStream out_)
           
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

GenericCrawler

public GenericCrawler()
Method Detail

init

public void init(java.lang.String seedFile)
          throws java.io.IOException
Throws:
java.io.IOException

crawl

public void crawl()
           throws java.io.IOException,
                  java.lang.InterruptedException,
                  ResourceException,
                  URLPoolException
Throws:
java.io.IOException
java.lang.InterruptedException
ResourceException
URLPoolException

printMonitoredState

public void printMonitoredState(java.io.PrintStream out_)
Specified by:
printMonitoredState in interface Monitorable

dump

public void dump()
Specified by:
dump in interface Monitorable

execute

public void execute(java.lang.String[] argv_)
             throws java.lang.Exception
Throws:
java.lang.Exception

main

public static void main(java.lang.String[] argv_)
                 throws java.lang.Exception
Throws:
java.lang.Exception