Sease at Apachecon 2016

November 21, 2016
2 mins read

The latest innovations from dozens of Apache projects and their communities in a collaborative, vendor-neutral environment.

Location: Seville (Spain)

Date: 16-18 November 2016

our talk

This presentation will start by introducing how Apache Lucene can be used to classify documents using data structures that already exist in your index instead of having to generate and supply external training sets. Building on the introduction the focus will be on extensions of the Lucene Classification module that come in Lucene 6.0 and the Lucene Classification module’s incorporation in to Solr 6.1. These extensions will allow you to classify at a document level with individual field weighting, numeric field support, lat/lon fields etc. The Solr ClassificationUpdateProcessor will be explored, such as how it works, and how to use it including basic and advanced features like multi class support and classification context filtering. The presentation will include practical examples and real world use cases.

our speaker

Alessandro Benedetti

FOUNDER @ SEASE

APACHE LUCENE/SOLR COMMITTER
APACHE SOLR PMC MEMBER

Senior Search Software Engineer, his focus is on R&D in Information Retrieval, Information Extraction, Natural Language Processing, and Machine Learning.
He firmly believes in Open Source as a way to build a bridge between Academia and Industry and facilitate the progress of applied research.

slides

Apache Lucene/Solr Document Classification from Sease

apachecon

Other posts you may find useful

Lexically accelerated vector search SeededKnnVectorQuery Support in Apache Solr 10

We are Sease, an Information Retrieval Company based in London, focused on providing R&D project guidance and implementation, Search consulting services, Training, and Search solutions using open source software like Apache Lucene/Solr, Elasticsearch, OpenSearch and Vespa.

Sign up for our Newsletter

Did you like this post? Don’t forget to subscribe to our Newsletter to stay always updated in the Information Retrieval world!

About the company

about our work

Rated Ranking Evaluator
(RRE)

Rated Ranking Evaluator Enterprise (RREE)

Apache Solr LLM Highlighter plugin

News

Main Blog

TIPS AND TRICKS

LATEST BLOG POST

contact us

Don't miss all the news - subscribe to our newsletter!

Sease at Apachecon 2016

our talk

our speaker

Alessandro Benedetti

slides

Other posts you may find useful

Lexically accelerated vector search: SeededKnnVectorQuery Support in Apache Solr 10

Solr Is Learning To Rank Better – Part 1 – Data Collection

How to calculate aggregations in Elasticsearch as percentages?

Lisa Biella

Lisa Biella

Follow Us

Top Categories

Recent Posts

Boosted K-Nearest Neighbor Search

Vector Search Doctor (Part 2): Bridging the Gap Between Theory and Practice in Vector Search

Vector Search Doctor (Part 1): Beyond the MTEB Leaderboard for Custom Datasets

Monthly video

Sign up for our Newsletter

Leave a Reply Cancel reply

Quick Links

Services

Subscribe

About the company

about our work

Rated Ranking Evaluator (RRE)

Rated Ranking Evaluator Enterprise (RREE)

Apache Solr LLM Highlighter plugin

News

Main Blog

TIPS AND TRICKS

LATEST BLOG POST

contact us

Don't miss all the news - subscribe to our newsletter!

Sease at Apachecon 2016

our talk

our speaker

Alessandro Benedetti

slides

Other posts you may find useful

Lexically accelerated vector search: SeededKnnVectorQuery Support in Apache Solr 10

Solr Is Learning To Rank Better – Part 1 – Data Collection

How to calculate aggregations in Elasticsearch as percentages?

Lisa Biella

Lisa Biella

Follow Us

Top Categories

Recent Posts

Boosted K-Nearest Neighbor Search

Vector Search Doctor (Part 2): Bridging the Gap Between Theory and Practice in Vector Search

Vector Search Doctor (Part 1): Beyond the MTEB Leaderboard for Custom Datasets

Monthly video

Sign up for our Newsletter

Leave a Reply Cancel reply

Rated Ranking Evaluator
(RRE)