public class ExtractingDocumentLoader extends ContentStreamLoader
Modifier and Type | Class and Description |
---|---|
static class |
ExtractingDocumentLoader.MostlyPassthroughHtmlMapper |
Modifier and Type | Field and Description |
---|---|
protected AutoDetectParser |
autoDetectParser |
protected TikaConfig |
config |
protected SolrContentHandlerFactory |
factory |
protected ParseContextConfig |
parseContextConfig |
static String |
TEXT_FORMAT
Extract Only supported format
|
static String |
XML_FORMAT
Extract Only supported format.
|
pollQueueTime
Constructor and Description |
---|
ExtractingDocumentLoader(SolrQueryRequest req,
UpdateRequestProcessor processor,
TikaConfig config,
ParseContextConfig parseContextConfig,
SolrContentHandlerFactory factory) |
Modifier and Type | Method and Description |
---|---|
void |
load(SolrQueryRequest req,
SolrQueryResponse rsp,
ContentStream stream,
UpdateRequestProcessor processor)
Loaders are responsible for closing the stream
|
getDefaultWT, init
public static final String TEXT_FORMAT
public static final String XML_FORMAT
protected AutoDetectParser autoDetectParser
protected TikaConfig config
protected ParseContextConfig parseContextConfig
protected SolrContentHandlerFactory factory
public ExtractingDocumentLoader(SolrQueryRequest req, UpdateRequestProcessor processor, TikaConfig config, ParseContextConfig parseContextConfig, SolrContentHandlerFactory factory)
public void load(SolrQueryRequest req, SolrQueryResponse rsp, ContentStream stream, UpdateRequestProcessor processor) throws Exception
ContentStreamLoader
load
in class ContentStreamLoader
req
- The input SolrQueryRequest
rsp
- The response, in case the Loader wishes to add anythingstream
- The ContentStream
to addprocessor
- The UpdateRequestProcessor
to useException