Skip to main content

Getting back into parallel computing with Apache Spark

Returning to parallel computing with Apache Spark has been insightful, especially observing the increasing mainstream adoption of the McColl and Valiant BSP (Bulk Synchronous Parallel) model beyond GPUs. This structured approach to parallel computation, with its emphasis on synchronized supersteps, offers a practical framework for diverse parallel architectures.While setting up Spark on clusters can involve effort and introduce overhead, ongoing optimizations are expected to enhance its efficiency over time. Improvements in data handling, memory management, and query execution aim to streamline parallel processing.A GitHub repository for Spark snippets has been created as a resource for practical examples. As Apache Spark continues to evolve in parallel with the HDFS (Hadoop Distributed File System), this repository intends to showcase solutions leveraging their combined strengths for scalable data processing.



Popular posts from this blog

Stream PRAM: Research: Darrell Ulm @ Microsoft Research

Stream Pram is a paper co-written by Darrell Ulm, cat be accessed at Darrell Ulm Stream Pram Research Paper This is a paper about a multiple instruction stream style model of Parallel Random Access Memory (PRAM) parallel computation. The paper deals mostly with theoretical parallel computation as compared to applied parallel computing. Other links about the Stream Pram. Profile . Wordpress , Tumblr

Drupal: Darrell Ulm User Profile

The Drupal Profile for Darrell Ulm and links to projects such as the Google Books module and other git commits to Drupal projects. The profile contains information about other projects like IP Path Access, a module to block access by IP for specific pages, except for set IP address or IP address ranges. Some other projects contributed are Site Map, Sunlight Congressional Districts, and File Field Role Limit. And it appears the profile has been active for just over 10 years, and recently obtained the Acquia Certified Drupal Developer specification, via a test. Here is the Drupal profile link for  Darrell Ulm . Also similar posts and info. is obtainable at: SuperPowerPlanet , WordPress , and Tumblr  for a different organization of the contents.