org.erowid.sperowider.simple
Class SimplePageSpider
java.lang.Object
org.erowid.sperowider.ASpiderBase
org.erowid.sperowider.simple.SimplePageSpider
- public class SimplePageSpider
- extends ASpiderBase
Takes an HTML location, indexes it, grabs the list of URLs.
- Version:
- $Id: SimplePageSpider.java,v 1.4 2005/01/16 05:03:01 gurustu Exp $
- Author:
- sstatman@real.com
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
SimplePageSpider
public SimplePageSpider(ISimpleSpiderModel model,
ISimpleSpiderFilter urlFilter,
IndexWriter writer)
- Constructs a page handler with a given data store.
handleConnectionException
public void handleConnectionException(String sourceUrl,
Throwable e)
- Logs error.
- Specified by:
handleConnectionException
in class ASpiderBase
handleConnection
public int handleConnection(String sourceUrl,
HttpURLConnection connection)
- Loads HTML page, parses for links, indexes.
- Specified by:
handleConnection
in class ASpiderBase