org.apache.solr.hadoop

Class SolrCloudPartitioner



  • public class SolrCloudPartitioner
    extends <any>
    MapReduce partitioner that partitions the Mapper output such that each SolrInputDocument gets sent to the SolrCloud shard that it would have been sent to if the document were ingested via the standard SolrCloud Near Real Time (NRT) API. In other words, this class implements the same partitioning semantics as the standard SolrCloud NRT API. This enables to mix batch updates from MapReduce ingestion with updates from standard NRT ingestion on the same SolrCloud cluster, using identical unique document keys.