Skip to main content

Getting back into parallel computing with Apache Spark

Getting back into parallel computing with Apache Spark has been great, and it has been interesting to see the McColl and Valiant BSP (Bulk Synchronous Parallel) model finally start becoming mainstream beyond GPUs.

While Spark can be some effort to setup on actual clusters and does have an overhead, thinking that these will be optimized over time and Spark will become more and more efficient. 

I have started a GitHub repo for Spark snippets if any are of interest as Apache Spark moves forward 'in parallel' to the HDFS (Hadoop Distributed File System).


Popular posts from this blog

Paper by Darrell Ulm: Virtual Parallelism by Self Simulation of the Multiple Instruction Stream Associative Model

The CiteSeer entry: "Virtual Parallelism by Self Simulation of the Multiple Instruction Stream Associative Model" (1995), Darrell Ulm. Research paper deals with relative power of the MASC model when simulating itself, and the algorithmic overhead to simulate. Virtual Parallelism by Self Simulation of the Multiple Instruction Stream Associative Model, Darrell Ulm Tumblr , Wordpress

Drupal 8 Article by Darrell Ulm

This is a link to an early article about Drupal 8, 2012, written by Darrell Ulm, when Drupal 8 was in it's early stages of development. A blog post on Drupal 8: "Should you be interested in the new Drupal 8?", by Darrell Ulm Tumblr , Wordpress