This blog post aims to give a better understanding of Docvalues and stored fields in Apache Solr for the operations in which they can be used interchangeably.
In this post we describe an approach to solve the problem of an application that requires both Full and Atomic Updates, using one of the powerful concepts in Object Oriented Programming: Polymorphism.
Scenario You’re working as a search engineer for XYZ Ltd, a company which sells electric components. XYZ provided you the application logs of the last six months, and some business requirements. Two kinds of customers, two kinds of requirements, two kinds of search The log analysis shows that XYZ has mainly two kinds of customers: … Continue reading Apache Solr: orchestrating Known item and Full-text search
Quantity detection? What is a quantity? And why do we need to detect it? A quantity, as described by Martin Fowler in his "Analysis Patterns"  is defined as a pair which combines an amount and unit (such as 30 litres, 0.25 cl, or 140 cm). In search-based applications, there are many cases where you may … Continue reading Give the height the right weight: quantities detection in Apache Solr
Introduction This blog post is about the Lucene Inverted Index and how Apache Solr internally works. When playing with Solr systems, understanding and properly configuring the underline Lucene Index is fundamental to deeply control your search. With a better knowledge of how the index looks like and how each component is used, you … Continue reading Exploring Solr Internals : The Lucene Inverted Index
Introduction This blog post is about the Solr classification module and the way Lucene classification has been integrated at indexing time. In the previous blog  we have explored the world of Lucene Classification and the extension to use it for Document Classification . It comes natural to integrate Solr with the Classification module and … Continue reading Solr Document Classification – Part 1 – Indexing Time