lucene - SOLR | De-Duplication | Remove duplicate records based on their status -
i indexing 2 different model same data different status.
eg: scenario -1
{model: "aaaa", name: "abc", status: "t"} {model: "bbbb", name: "abc", status: "a"}
expected output
{model: "bbbb", name: "abc", status: "a"}
scenario -2
{model: "aaaa", name: "abc", status: "a"} {model: "bbbb", name: "abc", status: "t"}
expected output
{model: "aaaa", name: "abc", status: "a"}
scenario -3
{model: "aaaa", name: "abc", status: "a"} {model: "bbbb", name: "abc", status: "a"}
expected output
{model: "aaaa", name: "abc", status: "a"} either one.
scenario -4
{model: "aaaa", name: "abc", status: "t"} {model: "bbbb", name: "abc", status: "t"}
expected output
{model: "aaaa", name: "abc", status: "t"} either one.
scenario 3 & 4 working expected current configuration have given below.
for scenario 1 & 2 output should based on status of record.
please me fix scenario 1 & 2.
solr version : 5.3
solrconfig.xml
<requesthandler name="/update" class="solr.updaterequesthandler" > <lst name="defaults"> <str name="update.chain">dedupe</str> </lst> </requesthandler> <updaterequestprocessorchain name="dedupe"> <processor class="solr.processor.signatureupdateprocessorfactory"> <bool name="enabled">true</bool> <str name="signaturefield">signature</str> <bool name="overwritedupes">true</bool> <str name="fields">id</str> <str name="signatureclass">solr.processor.lookup3signature</str> </processor> <processor class="solr.logupdateprocessorfactory" /> <processor class="solr.runupdateprocessorfactory" /> </updaterequestprocessorchain>
Comments
Post a Comment