|
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
ASpiderBase.spider(String)
to indicate that the download was
not done because this file was downloaded under another name.
MasterRepository.setSearchEngineType(String)
.
SperowiderModel.addFoundURL(String, String, boolean)
with an exclude flag of false.
AHandler
to the pool, mapped to a specific MIME type and file
extension.
AHandler
, mapped to a specific file extension.
AHandler
, mapped to a specific file extension.
AHandler
, mapped to a specific MIME type.
AHandler
, mapped to a specific MIME type.
Repository
.
ISperowiderModelReporter
to be delegated to.
ASpiderBase.spider(String)
to indicate that the download failed
because of an HTTP error (a 404, for example).
ISperowiderModel
.SperowiderContext.isCompressIndex()
.
SperoLog.DOWNLOAD_LOGNAME
SperoLog.DOWNLOAD_DOWNLOAD_LOGNAME
SperoLog.DOWNLOAD_ERROR_LOGNAME
SperoLog.DOWNLOAD_INVALIDURL_LOGNAME
SperoLog.DOWNLOAD_SPIDER_LOGNAME
SperoLog.DOWNLOAD_URLFOUND_LOGNAME
SperoLog.DOWNLOAD_URLMAP_LOGNAME
SperoLog.DOWNLOAD_URLPOPFROMQUEUE_LOGNAME
ASpiderBase.spider(String)
to indicate that the download failed
because of a thrown exception.
ASpiderBase.spider(String)
to indicate that the download was not
done because it was blocked by a filter (either robots.txt or the Sperowider
filter itself).
HTMLMetaTag
with the given name property.
MasterRepository
, this class will generate the documentation by delegating
to the other "generator" classes : GeneratorApi
,
GeneratorDocBase
, and GeneratorSearchBase
.MasterRepository
from an XML config file
SperowiderContext
.
AHandler
that will be used for
unrecognized MIME types and file extensions.
File
.
ASpiderBase.ALREADY_GRABBED
,
ASpiderBase.BAD_HTTP_RESPONSE
, ASpiderBase.EXCEPTION
,
ASpiderBase.FILTER_FAILURE
, ASpiderBase.SUCCESS
.
StringBuffer
.
FileNameManager.getRoot()
.
AHandler
for the passed in file extension.
AHandler
for the passed in MIME type.
List
of String
names that are required, but are not
filled in (either by defaults or by calling Configuration.set(String, Object)
or
Configuration.setUnlessSet(String, Object)
.
BasicTable.MEMORY
or BasicTable.CACHED
.
FileNameManager.getRoot()
.
ISperowiderModel.mapFoundURLToRealURL(String, String)
AHandler.getRequiredFilenameSuffix()
by providing a list of
suffixes that could simply be replaced with the value returned by
AHandler.getRequiredFilenameSuffix()
.
SearchResultEntry
objects in
this collection.
SearchResultEntry
at the index.
BasicTable.getMode()
to return either "create cached table" or "create memory table".
PatternMatchingMongler.setPattern(String)
that has the actual URL in it.
AHandler
objects, and a map from MIME types and
file extensions to those objects.TextHtmlHandler
text/css maps to TextCssHandler
application/x-javascript maps to PatternMatchingHandler
.html maps to TextHtmlHandler
.css maps to TextCssHandler
.js maps to PatternMatchingHandler
It also creates the default handler, which is used for unrecognized MIME
types and file extensions.
AHandler
.
ASpiderBase.spider(String)
when an exception is found
when an attempt to load the URL is hit.
SperoLog.INDEX_LOGNAME
SperoLog.INDEX_ERROR_LOGNAME
SperoLog.INDEX_INDEX_LOGNAME
SperowiderContext.getIndexLimit()
.
SimpleSpider
.SimpleSpider
uses.ISperowiderModel
.Indexer
.
Document
for the passed in file, and adds it to the index.
NoHopSimpleSperowiderFilter.init(Element)
IInitializableObject.init(Element)
called on it.
SperowiderConfigurator.instantiate(SperowiderConfiguration, String, String)
with a null
target, returning a SperowiderConfiguration
suitable for use in
Sperowider.Sperowider(SperowiderConfiguration)
.
SperowiderConfigurator.instantiate(SperowiderConfiguration, String, String)
with a null
target, returning a SperowiderConfiguration
suitable for use in
Sperowider.Sperowider(SperowiderConfiguration)
.
SperowiderConfigurator.instantiate(SperowiderConfiguration, String, String)
with a null
target, returning a SperowiderConfiguration
suitable for use in
Sperowider.Sperowider(SperowiderConfiguration)
.
SperowiderConfiguration
suitable for use in
Sperowider.Sperowider(SperowiderConfiguration)
, using the named file, with the named target.
instantiate(configDoc, null)
.
SperowiderConfiguration
based on the named target in the config document.
InputStream
.
SearcherApplet.search(String)
was called.
BasicSperowiderModel.getFoundURLs(String)
or BasicSperowiderModel.getSourceURLs(String)
.
ISperowiderModel.getSourceURLs(String)
and ISperowiderModel.getFoundURLs(String)
,
false otherwise.
SperowiderModel.getFoundURLs(String)
and SperowiderModel.getSourceURLs(String)
,
so this method can return true, if "support-spider-map" is set to true in the model
declaration of the config file.
ISimpleSpiderModel.getSourceURLs(String)
and ISimpleSpiderModel.getFoundURLs(String)
,
false otherwise.
SimpleSpiderModel.getSourceURLs(String)
and SimpleSpiderModel.getFoundURLs(String)
are not supported in this model.
Configuration.set(String, Object)
or
Configuration.setUnlessSet(String, Object)
, or have defaults.
MasterRepository
.MapTable.map(String)
, allowing this to be used as a
simple map.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
and/or AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader, Writer)
.
AMongler.mongle(BufferedReader)
.
URLMongler.mongle(HTMLShredder)
.
URLMongler.mongle(HTMLShredder)
.
URLMongler.mongle(HTMLShredder)
.
URLMongler.mongle(HTMLShredder)
.
URLMongler.mongle(HTMLShredder)
.
URLMongler.urlFound(String, MongledURLType)
whenever a URL is found, and replaces it with the
value returned by that method.
IThrottle
that does no throttling.
IThrottle
that does not
ever block.InputStream
that
pipes in the data for the requested file.
ISperowiderModel
.RegexFilter
.SperoLog.RECTIFY_LOGNAME
SperoLog.RECTIFY_DOCUMENT_LOGNAME
SperoLog.RECTIFY_ERROR_LOGNAME
AHandler
objects from the HandlerPool
to rectify a file.
Rectifier
objects.AIncludeExcludeFilter
.NoHopRegexSperowiderFilter
instead of this class.PatternMatchingHandler.download(HttpURLConnection, String, String)
to see how URLs are rectified using
this handler.
ISperowiderModelReporter
objects.
ISperowiderModelReporter
objects.
ISperowiderModelReporter
objects.
ISperowiderModelReporter
objects.
ISperowiderModelReporter
objects.
Rectifier
objects,
until no files are left.
Sperowider.setShouldDownload(boolean)
,
and Sperowider.setShouldIndex(boolean)
and Sperowider.setShouldRectify(boolean)
.
MasterRepository.setSearchEngineType(String)
.
ASpiderBase.spider(String)
to indicate that the download succeeded.
SearchResults
collection.
SearchResultEntry
objects, generated by a call
to SearchIndexReader.search(String)
.Configuration
if a
request setting has not been created.AIncludeExcludeFilter
that
uses the filter rules from SimpleMatcher
.SimpleFilter
.SimpleSpiderModel
.NoHopSimpleSperowiderFilter
instead of this class.SperowiderConfiguration
.
Sperowider
.ISperowiderModel
objects.Sperowider
class, and then delegating to that class.SplitIndexFileInputStream
for Lucene.SplitIndexFileInputStream
in a SplitIndexDirectory
.SplitIndexSegment
objects.search(term, true)
.
Sperowider.run()
is called.
Sperowider.run()
is called.
Sperowider.run()
is called.
PatternMatchingMongler.setPattern(String)
that has the actual URL in it.
Configuration.setUnlessSet(String, Object)
, unless the value is null.
IThrottle
, this class
is constructed with the minimum number of milliseconds that must pass between
consecutive times that Throttle.throttle()
will unblock.NoHopSimpleSperowiderFilter
instead.DownloadRunner.setLimit(int)
) is reached, or until no more URLs
are found.
SplitIndexDirectoryDescriptor
.WriteableSplitIndexFileDescriptor
.
|
|
|||||||||||
PREV NEXT | FRAMES NO FRAMES |