elasticsearch

Spark Structured Streaming with Elasticsearch

Reading Time: 3 minutes There’s been a lot of time we have been working on streaming data. Using Apache Spark for that can be much convenient. Spark provides two APIs for streaming data one is Spark Streaming which is a separate library provided by Spark. Another one is Structured Streaming which is built upon the Spark-SQL library. We will discuss the trade-offs and differences between these two libraries in Continue Reading

Amazon ES – Secure your cluster from anonymous users! #2

Reading Time: 5 minutes In the previous blog, we have learned how to create a domain on Amazon ES and also how to create an index using Curl on the cluster. Now, let’s just look how we can control access to Amazon ES Domain. One of the key benefits of using Amazon ES is that you can utilize AWS Identity and Access Management (IAM) to control access to your Continue Reading

Amazon ES – setting up the cluster! #1

Reading Time: 4 minutes Amazon Web Services (AWS) is a cloud services platform, providing compute power, database storage, content delivery, security options and other functionality to allow businesses to build sophisticated applications with increased flexibility, scalability and reliability. Amazon Elasticsearch is one of the services provided by AWS. Amazon ES Amazon Elasticsearch Service, also called Amazon ES, is a managed service that makes it easy to create a domain, Continue Reading

Elasticsearch – Pulling Socks!

Reading Time: 3 minutes Well I must say, during my internship I have come across new technologies. But I am quite fascinated by this search engine – Elasticsearch! I want to explore this tool. Reasons are – Used by big websites like Github (to search huge number of projects) Works quite like Google (at first look) Will enhance my knowledge – I’ll learn the “actual Java” – the practical Continue Reading

Autocomplete using Elasticsearch

Reading Time: 2 minutes You would have seen in a movie data store like IMDB, Whenever a user enters ‘g’, the search bar suggests him that you might be looking for gone girl or all the movies that have ‘g’ in them. This is what an Autocomplete or word completion is and it has become an essential part of any application. Autocomplete speeds up human-computer interaction by predicting the Continue Reading

Introduction to Elasticsearch in Scala

Reading Time: 2 minutes ElasticSearch is a real-time distributed search and analytics engine built on top of Apache Lucene. It is used for full-text search, structured search and analytics. Lucene is just a library and to leverage its power you need to use Java. Integrating Lucene directly with your application is a very complex task. Elastic Search uses the indexing and searching capabilities of Lucene but hides the complexities Continue Reading

How to tokenize your search by N-Grams using Elastic Search in Scala?

Reading Time: 2 minutes N–Grams can be used to search big data with compound words. German language is famous and referred for combining several small words into one massive compound word in order to capture precise or complex meanings. N-Grams are the fragments in which a word is broken, and as more number of fragments relevant to data, the more fragments will match.N-Grams has its length of fragment as Continue Reading

Implementing full text search with Couchbase and harnessing the power of Couchbase full text search (CBFT)

Reading Time: 5 minutes Hey Folks.! In this blog we are going to be introduced to the Couchbase Full text search. In my recent blog ,we talked about how we can user ElasticSearch for the full text search and how we can connect it with Couchbase so that our data gets copied in real time and we can search on it too. But what if we do not want Continue Reading

Working with Nested Aggregation of Elasticsearch

Reading Time: 2 minutes First of all we need to understand aggregation in ElasticSearch.In Elasticsearch an aggregation can be seen as a unit of work that builds analytic information over a set of documents.It is a powerful tool for build complex summaries of the data. There are many different types of aggregations, each with its own purpose and output. To better understand these types, it is often easier to break Continue Reading

Shield: Protect your Elasticsearch Data

Reading Time: 2 minutes What is shield? Shield allows you to easily protect Elasticsearch cluster from unintentional modification or unauthorized access with a username and password. Shield also gives security features like encryption, role-based access control, IP filtering, and auditing are also available when you need them. In this article, we will explore how to create basic and role-based access control authorization with shield. Installation : Shield installs as Continue Reading

Working With Elasticsearch In Scala

Reading Time: 3 minutes Elasticsearch is an open-source, restful, distributed, search engine built on top of apache-lucene. You can read it more on their website. We have an application built in scala with sbt as our build tool. Now we required to have a search capability on the output created by the application. We chose to use elasticsearch for our application. In this post, we will learn to use Continue Reading