Intended Audience: Software Engineers, Technical Leads, Technical Managers
Pre-Requisites: Basic understanding of search engines
Skills you’ll gain
– Deep understanding of how Matching works
– Deep understanding of how Ranking works
– Capacity of customizing and tuning your system relevance
– Better debugging/troubleshooting ability
– How to measure the search quality of your engine
Duration: 3 days

Apache Lucene/Solr

  • Introduction on Information Retrieval and Lucene/Solr technologies
  • The Apache Lucene/Solr Index
  • Apache Lucene/Solr field analysis : the schema.xml
  • Apache Lucene scoring and document similarity
  • Debug Query deep dive 
  • Apache Solr query parsers (with a focus on the dismax and edismax)
  • Field centric vs term centric approach
  • Disjunction max vs Boolean approach (and everything in the middle)
  • Function Queries
  • Additive and Multiplicative boost functions and queries
  • Distributed search relevance considerations
  • Search Quality Evaluation : how to test relevance
  • Learning to Rank : a machine learning approach for relevance
  • Relevance War Stories: 

Elasticsearch

  • Introduction to Information Retrieval
  • Apache Lucene
  • Elasticsearch
  • The Inverted Index
  • Mapping and Text Analysis
  • Apache Lucene scoring and document similarity
  • Validate API
  • Explain API
  • Query DSL
  • Leaf query clauses
  • Emphasis on Multi Match Query Clause
  • Field centric vs term centric approach
  • Compound Query Clauses
  • Emphasis on Disjunction Max
  • Emphasis on Function Score
  • Additive and Multiplicative boost functions and queries
  • Distributed search relevance considerations
  • Search Quality Evaluation : how to test relevance
  • Learning to Rank : a machine learning approach for relevance
  • Relevance War Stories: