Intended Audience: Software Engineers, Technical Leads, Technical Managers
Pre-Requisites: Basic understanding of search engines
Skills you’ll gain
– Deep understanding of how Matching works
– Deep understanding of how Ranking works
– Capacity of customizing and tuning your system relevance
– Better debugging/troubleshooting ability
– How to measure the search quality of your engine
Duration: 3 days
Apache Lucene/Solr
- Introduction on Information Retrieval and Lucene/Solr technologies
- The Apache Lucene/Solr Index
- Apache Lucene/Solr field analysis : the schema.xml
- Apache Lucene scoring and document similarity
- Debug Query deep dive
- Apache Solr query parsers (with a focus on the dismax and edismax)
- Field centric vs term centric approach
- Disjunction max vs Boolean approach (and everything in the middle)
- Function Queries
- Additive and Multiplicative boost functions and queries
- Distributed search relevance considerations
- Search Quality Evaluation : how to test relevance
- Learning to Rank : a machine learning approach for relevance
- Relevance War Stories:
Elasticsearch
- Introduction to Information Retrieval
- Apache Lucene
- Elasticsearch
- The Inverted Index
- Mapping and Text Analysis
- Apache Lucene scoring and document similarity
- Validate API
- Explain API
- Query DSL
- Leaf query clauses
- Emphasis on Multi Match Query Clause
- Field centric vs term centric approach
- Compound Query Clauses
- Emphasis on Disjunction Max
- Emphasis on Function Score
- Additive and Multiplicative boost functions and queries
- Distributed search relevance considerations
- Search Quality Evaluation : how to test relevance
- Learning to Rank : a machine learning approach for relevance
- Relevance War Stories: