Akka Actors on Mac beating the S#!t Out of my Dell

August 26, 2011 Vikas HazratiStudio-ScalaAkka, Akka actor, Benchmark, scala7 Comments

Table of contents

Reading Time: 2 minutes

Let me explain the scenario. We have n (tens, hundreds or thousands) of Akka actors listening to a queue on RabbitMQ server. So the scenario looks something like this


AMQP.newConsumer(connection, ConsumerParameters("*", actor, Some(queueName), Some(exchangeParameters)))

As you would notice, each of the actors gets a message from the Q and invokes a plugin to do the processing and returns the results back to the result Q.
Now the fun happens when I execute the same code on my machine and my competitors (Meetu’s) machine.

Meetu has a Macbook pro – 2 cores – 2.3 GHz Intel core i5 – RAM 4GB – 64 bit
Mine is Dell Vostro – 4 cores – 2.4 GHz Intel core i5 – RAM 3GB – 32 bit

And here are the results

We start with 10 actors getting 100 messages and go all the way to 1,000 actors getting 10,000 messages. Everything in the code is the same except the hardware on which the tests are executed. I start of badly and see that on my machine 10 actors are processing messages in 5.298 s as compared to 3.039s on Meetu’s mac. This prompts me to skip to the other portion of the spectrum where I feel I can show him the power of my 4 cores. I am mistaken that the initial load of fewer messages and actors is actually having overhead for 4 cores and they would perform well on the other end.

Sadly with 1000 actors and 10000 messages the tiny mac with 2 cores outperforms the hulk dell with 4 cores by a margin of more than 10s.

What could be going wrong? There is no other heavy processing happening, on either machine, when we are running the tests. I doubt 64 bit vs 32 bit would cause such an upset. What else?

Written by Vikas Hazrati

Vikas is the CEO and Co-Founder of Knoldus Inc. Knoldus does niche Reactive and Big Data product development on Scala, Spark, and Functional Java. Knoldus has a strong focus on software craftsmanship which ensures high-quality software development. It partners with the best in the industry like Lightbend (Scala Ecosystem), Databricks (Spark Ecosystem), Confluent (Kafka) and Datastax (Cassandra). Vikas has been working in the cutting edge tech industry for 20+ years. He was an ardent fan of Java with multiple high load enterprise systems to boast of till he met Scala. His current passions include utilizing the power of Scala, Akka and Play to make Reactive and Big Data systems for niche startups and enterprises who would like to change the way software is developed. To know more, send a mail to hello@knoldus.com or visit www.knoldus.com

7 thoughts on “Akka Actors on Mac beating the S#!t Out of my Dell1 min read”

Vikas Hazrati says:

August 26, 2011 at 11:24 AM

Another data point: My dell is running on Ubuntu 11.04
√iktor Klang says:

August 26, 2011 at 12:35 PM

What does your akka.conf look like? Which version are you using?
1. Vikas Hazrati says:
  
  August 26, 2011 at 12:58 PM
  
  Viktor, Akka version is 1.1.3 and we are using the default configuration for now (akka-reference), nothing overridden.
  1. √iktor Klang says:
    
    August 26, 2011 at 2:33 PM
    
    If your core-count is correct (including things like HT) then you’re using different settings on your machines (factor): https://github.com/jboner/akka/blob/release-1.1.3/config/akka-reference.conf#L43
    
    2 cores for your mac = 2 * 1.0 == 2 threads in default dispatcher on OSX
    4 cores for your Dell = 4 * 1.0 == 4 threads in the default dispatcher on Windows
    
    Also, are you using exactly the same JVM options for both runs? (-server xmx/xms etc)?
√iktor Klang says:

August 26, 2011 at 2:40 PM

√iktor Klang :
If your core-count is correct (including things like HT) then you’re using different settings on your machines (factor): https://github.com/jboner/akka/blob/release-1.1.3/config/akka-reference.conf#L43
2 cores for your mac = 2 * 1.0 == 2 threads in default dispatcher on OSX
4 cores for your Dell = 4 * 1.0 == 4 threads in the default dispatcher on Windows

Of course this is if you’re not using the “GlobalExecutorBasedEventDriven” or if you’re using your own dispatcher.
Vikas Hazrati says:

September 1, 2011 at 10:59 AM

Viktor, sorry for the late reply, was out of action for a few days.
The core count is correct and there is no HT. and what you suggest is correct that we indeed end up using different settings on our machines in terms of the # of threads. But would having 4 threads on my linux dell make it slower than 2 threads on osx?
Vikas Hazrati says:

September 5, 2011 at 10:05 AM

btw just figured out that all mac book pros come with HT so ideally Runtime.getRuntime.availableProcessors would return 4 on both the machines.