Tag Archives: apache

Simple Things You Can Learn From Cassandra Nodetool (Monitor/Manage) For DC/OS


Cassandra native tool called nodetool is used for monitoring and managing cassandra cluster for dcos Continue reading

Advertisements
Posted in Best Practices, big data, Cassandra, cluster, NoSql | Tagged , , , , , , , , , , , , , | Leave a comment

Join Semantics in Kafka Streams


Introduction to core concepts:   Apache Kafka is a distributed streaming platform which enables you to publish and subscribe to a stream of records also letting you process this stream of records as it occurs. Kafka Streams is a client … Continue reading

Posted in Apache Kafka, Scala | Tagged , , , , , , | 2 Comments

What’s new in Apache Spark 2.2


Apache recently released a newer version of Spark i.e Apache Spark2.2. The new version comes with new improvements as well as the addition of new functionalities. The major addition to this release is Structured Streaming. It has been marked as production … Continue reading

Posted in apache spark, big data, Scala, Spark, Streaming | Tagged , , , , , , , , , , | 2 Comments

Apache Solr with Java: Result Grouping with Solrj


This blog is a detailed, step-by-step guide on implementing group by field in Apache Solr using Solrj. Note: Grouping is different from Faceting in Apache Solr. While grouping returns the documents grouped by the specified field, faceting returns the count of documents for … Continue reading

Posted in Java | Tagged , , | 1 Comment

Solr with Java: A basic hands-on with SolrJ


What is Apache Solr: Apache Solr is a search sever that includes the full-text search engine called Apache Lucene. It takes the piece of information (called documents) that are indexed according to the cores. When a query is performed, solr … Continue reading

Posted in Java | Tagged , , , , , | 3 Comments

Introduction to Kafka Connect


Knoldus organized a half an hour session on 29 July 2016 at 4:00 PM. It covers a brief introduction to Apache Kafka Connect, giving insights about the benefits of kafka connect, its use cases. It also covers the motivation behind … Continue reading

Posted in Apache Kafka, Scala | Tagged , , , | Leave a comment

Apache spark + cassandra: Basic steps to install and configure cassandra and use it with apache spark with example


To build an application using apache spark and cassandra you can use the datastax spark-cassandra-connector to communicate with spark. Before we are going to communicate with spark using connector we should know how to configure cassandra. So following are prerequisite … Continue reading

Posted in Scala, Spark | Tagged , , , , | 7 Comments

How to setup and use zookeeper in scala using Apache Curator


In order to use Zookeeper to manage your project’s configurations across the cluster, first we will setup the zookeeper ensemble on our local machine (setup is for testing on a single machine) by following these steps: 1) Download a stable … Continue reading

Posted in Java, Scala | Tagged , , , , | 1 Comment