This Week in Elasticsearch - June 18, 2014

Welcome to This Week in Elasticsearch. In this roundup, we try to inform you about the latest and greatest changes in Elasticsearch. We cover what happened in the GitHub repositories, as well as many Elasticsearch events happening worldwide, and give you a small peek into the future of the project.

David Pilato explains Elasticsearch in just 90 seconds ....

Elasticsearch Core

  • Analysis: Improvements to StemmerTokenFilter (#6452, master and 1.3)
  • Build: Generate source jars for tests (#6125, master and 1.3)
  • More Like This Query: Create only one MLT query per field for all queried items (#6404, master and 1.3)
  • Allocation: Enable DiskThresholdDecider by default (#6204, master and 1.3)
  • Cluster: Wait till node is part of cluster state for join process until returning to node (#6480, master and 1.3)
  • Query API: Added caching support to geohash_filter (#6478, master and 1.3)
  • Facets: Marked facets as deprecated in favor of aggregations (#6485, master and 1.3)
  • Packaging: Remove java-6 directories from debian init script (#6350, master and 1.3)
  • Java API: Make sure afterBulk is always called in BulkProcessor (#5038, master and 1.3)
  • Aggregations: Fix reducing of range aggregations (#7484, master, 1.3 and 1.2)
  • Aggregations: Remove ordinals execution hint (#6499, master and 1.3)
  • Fielddata: Don't expose hashes anymore (#6500, master and 1.3)
  • Aliases: Save memory when initializing maps (#6504, master, 1.3, 1.2 and 1.1)
  • Aggregations: Moved BucketsAggregator#docCounts field to IntArray (#6529, master and 1.3)
  • Aggregations: GlobalOrdinalsStringTermsAggregator is inefficient for high-cardinality fields (#6518, master and 1.3)
  • Percolator: Fix handling of nested documents (#6540, master and 1.3)
  • Aggregations: Added percentile rank aggregation (#6386, master and 1.3)

Shay Banon on all things Elasticsearch from the recent Bay Area Search Meetup

Elasticsearch Ecosystem

Here's some more information about what is happening in the ecosystem we are maintaining around the ELK stack - that's Elasticsearch plus Logstash and Kibana - including plugin and driver releases.

Slides & Videos

Michael Kaisser on Geospatial Analysis of Social Media posts with Elasticsearch from Berlin Buzzwords 2014

Matthew Britt from the University of Michigan showcases how to better handle HPC logs using Logstash

From June's Elasticsearch London Meetup

With excellent and useful information for non-hipsters, too!

Jens Kohl's overview of the Elasticsearch PHP client Itamar Syn-Hershko shares the Ultimate Guide to Elasticsearch plugins at Berlin Buzzwords 2014

Where to Find Us

We'd love to feature all the great Elasticsearch, Logstash, and Kibana presentations and meetups happening worldwide in this section. If you're speaking or hosting a meetup, let our Community Manager, Leslie Hawthorn, know!

Israel

  • The IGT Cloud Meetup Group is hosting a workshop on Applied Cloud Computing with Google Cloud Platform at Google Tel Aviv. Topics will include BigQuery with Logstash as Application Log Analysis Platform. The workshop runs from 9:30-13:30 on July 7th.
  • The Second Elasticsearch Tel Aviv Meetup will convene on July 28th, focusing on real world use cases. The agenda is still being finalized, but registration is already open.

The Netherlands

We've got not one but two awesome conferences going on in Amsterdam later this week:

  • DevOps Days Amsterdam: Join us June 19-21st for an Elasticsearch workshop, a little on DevOps for Happier Humans and all the news you can use at the Elasticsearch table. Swing by to say hello to Leslie Hawthorn and Kurt Hurtado! Lee Hinman will also join us for the workshop. Yay Lee!
  • GOTO Amsterdam: Our CTO, Shay Banon, will treat the crowd to a deep dive on Elasticsearch's APIs, plus we'll have lots of knowledge and swag on offer at our booth. See you there!

Plus, we're hosting Friday drinks on June 27th at 6:00 PM in Amsterdam. You'll have the chance to meet tons of our core developers, as the whole company will be visiting our EU HQ that week for our annual all hands gathering.

United States

  • New Meetup: The San Francisco Ruby on Rails meetup group will get together tomorrow night, Thursday June 19th, to discuss Elasticsearch and Logging. Doors open at 6:30 PM and registration is still open.
  • New Meetup: The Boston Puppet User Group will get together on June 24th to discuss how you can build a modern infrastructure in just 45 minutes. They'll be talking Logstash as part of the festivities. Doors open at 7:00 PM and registration is still open.
  • The Philadelphia Elasticsearch Meetup group will be getting together for their second meetup on June 26th. Details are still in the works, but sign up now so you know where to go for ELK treats on the 26th.
  • The SYN/(st)ACK NYC User Group will convene on June 30th at 6:30 PM. Squarespace is hosting the festivities, and Franklin Angulo, one of their Software Engineers, will be presenting. Franklin will cover building a scalable monitoring and alerting system using Graphite, Grafana, Collectd, Nagios, Logstash, Elasticsearch, and Kibana, among others.

Where to Find You

Our Community Manager, Leslie Hawthorn, is hard at work to help folks create more Elasticsearch meetup groups and to help meetup organizers find more speakers. If you are interested in either effort, take a moment to let her know.

Oh yeah, we're also hiring. If you'd like us to find you for employment purposes, just drop us a note.  We care more about your skill set and passion for Elasticearch, Kibana, and Logstash than where you rest your head.

Trainings

If you are interested in Elasticsearch training we have courses taught by our core developers coming up in:

  • Paris - July 08, 2014 (core Elasticsearch training)
  • Cologne - July 09, 2014 (core Elasticsearch training)
  • Seoul - July 10, 2014 (core Elasticsearch training)
  • London - July 14, 2014 (core Elasticsearch training)
  • Tokyo - July 14, 2014 (core Elasticsearch training)
  • Berlin - July 15, 2014 (ELK workshop)
  • Milan - July 16, 2014 (core Elasticsearch training)