Apache Solr Facets and ACL Filters Using Tag and Exclusion

What happens with facets aggregations on fields when documents in the results have been filtered by Access Control Lists ?In such scenarios it is important to use the facet mincount parameter.That specifies the minimum count in the result set for a facet value to appear in the response: mincount=0, all the facet values present in … Continue reading Apache Solr Facets and ACL Filters Using Tag and Exclusion

Apache Solr Distributed Facets

Apache Solr distributed faceting feature has been introduced back in 2008 with the first versions of Solr (1.3 according to this jira[1]) . Until now, I always assumed it just worked, without diving too much into the details. Nowadays distributed search and faceting are extremely popular, you can find them pretty much everywhere (in the … Continue reading Apache Solr Distributed Facets

Apache Lucene BlendedInfixSuggester : How It Works, Bugs And Improvements

The Apache Lucene/Solr suggesters are important to Sease : we explored the topic in the past[1] and we strongly believe the autocomplete feature to be vital for a lot of search applications. This blog post explores in details the current status of the Lucene BlendedInfixSuggester, some bugs of the most recent version ( with the … Continue reading Apache Lucene BlendedInfixSuggester : How It Works, Bugs And Improvements

SolrCloud exceptions with Apache Zookeeper

At the time we speak ( Solr 7.3.1 ) SolrCloud is a reliable and stable distributed architecture for Apache Solr. But it is not perfect and failures happen. Apache Zookeeper[1] is the system responsible of managing the communications across the SolrCloud cluster. It contains the shared collections configurations and it has the view of the … Continue reading SolrCloud exceptions with Apache Zookeeper

SolrCloud Leader Election Failing

At the time we speak ( Solr 7.3.0 ) SolrCloud is a reliable and stable distributed architecture for Apache Solr. But it is not perfect and failures happen. This lightening blog post will present some practical tips to follow when a specific shard of a collection is down with no leader and the situation is … Continue reading SolrCloud Leader Election Failing

ECIR 2018 Experience

This blog is a quick summary of my (subjective) experience at ECIR 2018 : the 40th European Conference on Information Retrieval, hosted in Grenoble (France) from 26/03/2018 to 29/03/2018. Deep Learning and Explicability Eight long papers accepted were about Deep Learning. The topics "Neural Network" and "Word Embedding" were the most occurring in the accepted … Continue reading ECIR 2018 Experience

Distributed Search Tips for Apache Solr

Distributed search is the foundation for Apache Solr Scalability : It's possible to distributed search across different Apache Solr nodes of the same collection ( both in a  legacy[1] or SolrCloud[2] architecture), but it is also possible to distribute search across different collections in a SolrCloud cluster. Aggregating results from different collections may be useful … Continue reading Distributed Search Tips for Apache Solr

Solr Is Learning To Rank Better – Part 4 – Solr Integration

Last Stage Of The Journey This blog post is about the Apache Solr Learning To Rank ( LTR ) integration. We modelled our dataset, we collected the data and refined it in Part 1 . Trained the model in Part 2 . Analysed and evaluate the model and training set in Part 3 . We … Continue reading Solr Is Learning To Rank Better – Part 4 – Solr Integration

Solr Is Learning To Rank Better – Part 3 – Ltr tools

Apache Solr Learning to Rank - Things Get Serious This blog post is about the Apache Solr Learning to Rank Tools : a set of tools to ease the utilisation of the Apache Solr Learning To Rank integration. The model has been trained in Part 2, we are ready to deploy it to Solr, but … Continue reading Solr Is Learning To Rank Better – Part 3 – Ltr tools

Solr Is Learning To Rank Better – Part 2 – Model Training

Model Training For Apache Solr Learning To Rank  If you want to train a model for Apache Solr Learning To Rank , you are in the right place. This blog post is about the model training phase for the Apache Solr Learning To Rank integration. We modelled our dataset, we collected the data and refined it … Continue reading Solr Is Learning To Rank Better – Part 2 – Model Training