apache

Apache Solr with Java: Result Grouping with Solrj

Reading Time: 4 minutes This blog is a detailed, step-by-step guide on implementing group by field in Apache Solr using Solrj. Note: Grouping is different from Faceting in Apache Solr. While grouping returns the documents grouped by the specified field, faceting returns the count of documents for each of the different values for the specified field. However you can combine grouping and faceting in Solr. This blog talks about grouping without the Continue Reading

Solr with Java: A basic hands-on with SolrJ

Reading Time: 2 minutes What is Apache Solr: Apache Solr is a search sever that includes the full-text search engine called Apache Lucene. It takes the piece of information (called documents) that are indexed according to the cores. When a query is performed, solr goes through the index and return the matching documents. Now let’s start the hands-on. Step 1: Install Solr from the following link. Step 2: Start Continue Reading

Introduction to Kafka Connect

Reading Time: < 1 minute Knoldus organized a half an hour session on 29 July 2016 at 4:00 PM. It covers a brief introduction to Apache Kafka Connect, giving insights about the benefits of kafka connect, its use cases. It also covers the motivation behind building Kafka Connect and an introduction to its architecture. Here is the video for the same.

Apache spark + cassandra: Basic steps to install and configure cassandra and use it with apache spark with example

Reading Time: 3 minutes To build an application using apache spark and cassandra you can use the datastax spark-cassandra-connector to communicate with spark. Before we are going to communicate with spark using connector we should know how to configure cassandra. So following are prerequisite to run example smoothly. Following steps to install and configure cassandra If you are new to cassandra first we nee to install cassandra on our Continue Reading

How to setup and use zookeeper in scala using Apache Curator

Reading Time: 2 minutes In order to use Zookeeper to manage your project’s configurations across the cluster, first we will setup the zookeeper ensemble on our local machine (setup is for testing on a single machine) by following these steps: 1) Download a stable zookeeper release 2) Unpack it at three places and rename it to: /home/user/Desktop/zookeeper1, /home/user/Desktop/zookeeper2, and /home/user/Desktop/zookeeper3 3) In order to use zookeeper we will need Continue Reading