Elasticsearch

Amazon ES – Secure your cluster from anonymous users! #2

In the previous blog, we have learned how to create a domain on Amazon ES and also how to create an index using Curl on the cluster. Now, let’s just look how we can control access to Amazon ES Domain. One of the key benefits of using Amazon ES is that you can utilize AWS Identity and Access Management (IAM) to control access to your Continue Reading

Amazon ES – setting up the cluster! #1

Amazon Web Services (AWS) is a cloud services platform, providing compute power, database storage, content delivery, security options and other functionality to allow businesses to build sophisticated applications with increased flexibility, scalability and reliability. Amazon Elasticsearch is one of the services provided by AWS. Amazon ES Amazon Elasticsearch Service, also called Amazon ES, is a managed service that makes it easy to create a domain, Continue Reading

Exploring JEST: Java HTTP REST Client

Elasticsearch is a real-time distributed and open source full-text search and analytics engine. To integrate Elasticsearch to our application, we need to use an API. Elasticsearch gives us two ways, REST APIs, and Native clients. It’s easy to get confused about all the different ways to connect to Elasticsearch and why one of them should be preferred over the other. Available Elasticsearch clients are: Node Continue Reading

Java High-Level REST Client – Elasticsearch

Elasticsearch is an open-source, highly scalable full-text search and analytics engine. Using this, you can easily store, search, and analyze a large amount of data in real time. Java REST client is the official client for Elasticsearch which comes in 2 flavors: Java Low-Level REST client – It allows communicating with an Elasticsearch cluster through HTTP and leaves requests marshalling & responses un-marshalling to users. Continue Reading

Deploying a 2 node Elasticsearch cluster on ec2 instance.

In this blog we will focus on two major things : 1). Steps required to create a two node elasticsearch (v5.2 released on 31.Jan.2017) cluster on Linux instances (with CentOs as the default OS). 2). Attaching additional volume to the instances and making changes in elasticsearch configurations so that all the elasticsearch related data will be stored on the mounted volumes, since the default storage Continue Reading

Neo4j vs ElasticSearch & Full Text Search In Neo4j

Hello Graphistas, Are you missing this series šŸ™‚ ? Welcome back again in the series of Neo4j with Scala šŸ˜‰ . Let’s start our journey again. Till now we have talked and learnt about the use of Neo4j with Scala and how easily we can integrated both two amazing technologies. Before starting the blog here is recap : Getting Started Neo4j with Scala : An Continue Reading

Autocomplete using Elasticsearch

You would have seen in a movie data store like IMDB, Whenever a user enters ā€˜gā€™, the search bar suggests him that you might be looking for gone girl or all the movies that have ā€˜gā€™ in them. This is what an Autocomplete or word completion is and it has become an essential part of any application. Autocomplete speeds up human-computer interaction by predicting the Continue Reading

Meetup: Stream Processing Using Spark & Kafka

Knoldus organized a Meetup on Friday, 9 September 2016. Topics which were covered in this meetup are: Overview of Spark Streaming. Fault-tolerance Semantics & Performance Tuning. Spark Streaming Integration with Ā Kafka. Meetup code sample available here Real time stream processing engineĀ application code available here

Building Analytics Engine Using Akka, Kafka & ElasticSearch

In this blog , I will share my experience on building scalable, distributed and fault-tolerantĀ  Analytics engine using Scala, Akka, Play, Kafka and ElasticSearch. I would like to take you through the journey of Ā building an analytics engineĀ which was primarily used for text analysis. The inputs were structured, unstructured and semi-structured data and we were doing a lot of data crunching using it. The Analytics Continue Reading

Introduction to Elasticsearch in Scala

ElasticSearch is a real-time distributed search and analytics engine built on top of Apache Lucene. It is used for full-text search, structured search and analytics. Lucene is just a library and to leverage its power you need to use Java. Integrating Lucene directly with your application is a very complex task. Elastic Search uses the indexing and searching capabilities of Lucene but hides the complexities Continue Reading

How to tokenize your search by N-Grams using Elastic Search in Scala?

N–Grams can be used to search big data with compound words. German language is famous and referred for combining several small words into one massive compound word in order to capture precise or complex meanings. N-Grams are the fragments in which a word is broken, and as more number of fragments relevant to data, the more fragments will match.N-Grams has its length of fragment as Continue Reading

Implementing full text search with Couchbase and harnessing the power of Couchbase full text search (CBFT)

Hey Folks.! In this blog we are going to be introduced to the Couchbase Full text search. In my recent blog ,we talked about how we can user ElasticSearch for the full text search and how we can connect it with Couchbase so that our data gets copied in real time and we can search on it too. But what if we do not want Continue Reading

Working with Nested Aggregation of Elasticsearch

First of all we need to understand aggregationĀ in ElasticSearch.In Elasticsearch an aggregation can be seen as a unit of work that builds analytic information over a set of documents.It is a powerful tool for build complex summaries of the data. There are many different types of aggregations, each with its own purpose and output. To better understand these types, it is often easier to break Continue Reading

%d bloggers like this: