Tips and Tricks
Welcome to our Tips and Tricks Blog, your ultimate resource for mastering the nuances of information retrieval. This specialized section is designed to empower professionals, enthusiasts, and learners alike with actionable insights, innovative techniques, and best practices that amplify efficiency and effectiveness in information retrieval tasks.

Solr AutoGeneratePhraseQueries and Minimum Should Match with Synonyms
The post discusses the interaction between three Solr parameters: autoGeneratePhraseQueries, synonyms, and minimum should match.

How to Use Apache Solr Request Parameters API
How to set and view Apache Solr Request Parameters API. Including real world examples for the Ping functionality.

Apache Solr autoGeneratePhraseQueries and Schema
Explore how the Apache Solr autogeneratePhraseQueries parameter works and how its default is affected by the schema version.

Apache Solr Filter Queries: Integer or String Fields?
This blog post explores whether to index a field in Apache Solr as a string or integer for optimal filter query performance.

Neo4J Optimization Tips
Tips and tricks regarding Neo4J optimization (query optimization, memory and disk configuration).

How to Sort Apache Solr Results in Random Order
Let’s explore how to obtain a set of documents randomly sorted, regardless of their relevance, in Apache Solr.

How to calculate aggregations in Elasticsearch as percentages?
This blog post shows you how to calculate aggregations in Elasticsearch as percentages through the use of Bucket Script Aggregation

Online Search Quality Evaluation With Kibana – Queries in Common
This ‘tips and tricks’ describes the steps to follow to compare models on common queries in online search quality evaluation

How to Import Pandas in AWS Lambda
In this blog post, you will learn how to import the Pandas library in AWS Lambda in order to execute python scripts.

How to Deal with Too Many ‘object’ in Pandas from JSON Parsing
We propose and test a way to manage categorical data during the collection and store it directly as numeric types in the JSON.

Efficiently Manage Numeric Ids in JSON and Pandas
In this blog post we make an experimental analysis to identify the best data type to use when dealing with ids.

How to manage large JSON efficiently and quickly: multiple files
Tips and tricks to find out efficient and fast ways to read and parse a big JSON file in Python using real-world application

How to manage large JSON files in Python efficiently and quickly
Tips and tricks to find out efficient and fast ways to read and parse a big JSON file in Python using real-world application

Drop constant features: a real-world Learning To Rank scenario
Does removing constant features affect model performance? Find out with our real-world Learning to Rank application

The luceneMatchVersion Parameter in Apache Solr
This blog explores how the luceneMatchVersion parameter in the solrconfig.xml works in Apache Solr. Dos and don’ts, and anything in between.

Common Errors with Apache Solr Feature and Model Stores
Common errors and warnings in manipulating feature stores and models in Solr. Pay attention also to JVM Heap and Zookeeper.

Apache Solr Learning to Rank Feature Stores and Models
How to list, upload, delete feature stores and models necessary in Solr for learning to rank.

Elasticsearch Alias Field Type
In this blog post we will briefly recap things we can’t do when using the alias field type in Elasticsearch.

Digging in the Solr code: 5 minutes how to
Let’s quickly setup a Solr development environment for implementation and debugging purposes, with our 5 minutes how to!

The Request Handlers Jungle – handleSelect and qt Parameter
This blog post is about the handleSelect and qt parameter in Apache Solr and their relation with the Request Handler mechanism.

Apache Solr Facets and ACL Filters Using Tag and Exclusion
Tag and Exclusion for Apache Solr Faceting is often misused. Here’s a guide to optimal utilisation with a focus on ACL filtering scenarios.

Apache Solr Distributed Facets
How faceting is calculated in Apache Solr distributed architectures. It presents inner details explanation and practical examples.

Synonyms and Stopwords: Vademecum
In this post we’ll cover two additional synonyms scenarios and we’ll try to summarise all previous tips in a concise form.

Still Synonyms + Stopwords?? Mamma mia!
Synonyms and Stopwords: how to deal with missing synonym detection when the original text contains some “intruder” stopword?

Synonyms + Stopwords?? OMG!
How can we manage synonyms and stopwords at query time? In this article we will try to address this specific and very popular use case.

Apache Solr/Elasticsearch: How to Manage Multi-term Concepts out of the Box?
Approach for searching multi-term entities/concepts out-of-the-box, without installing any additional components.

SolrCloud exceptions with Apache Zookeeper
Practical tips to follow when your client application encounters some classic exceptions dealing with SolrCloud and Apache Zookeeper.