Win a copy of Testing JavaScript Applications this week in the HTML Pages with CSS and JavaScript forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Bear Bibeault
  • Ron McLeod
  • Jeanne Boyarsky
  • Paul Clapham
Sheriffs:
  • Tim Cooke
  • Liutauras Vilda
  • Junilu Lacar
Saloon Keepers:
  • Tim Moores
  • Stephan van Hulst
  • Tim Holloway
  • fred rosenberger
  • salvin francis
Bartenders:
  • Piet Souris
  • Frits Walraven
  • Carey Brown

Free online course on Apache Spark for Big Data (starts 1st june)

 
Bartender
Posts: 2407
36
Scala Python Oracle Postgres Database Linux
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
There is a new online course from EdX on Introduction to Big Data with Apache Spark starting on 1st June. You can take the course online for free, or pay a small fee ($50) for the "verified certificate" option.

Apache Spark is a "fast and general engine for large-scale data processing", which is gaining a lot of interest for Big Data applications. It can run on an existing Hadoop YARN cluster, an Apache Mesos cluster, on a stand-alone cluster, or you can even run it on your local PC e.g. for ad hoc data exploration. It includes powerful tools for reading and processing data from different sources e.g. CSV files, HDFS, databases etc, as well as specific libraries for machine learning and stream processing. Spark provides APIs for Scala, Python and Java. There are interactive shells for Scala and Python, and the Python API can also be used interactively via the IPython Notebook.

Spark was a very popular topic for dicussion at the recent Strata Hadoop World conference in London, so this could be your opportunity to find out more about this great new tool for Big Data applications.
 
Java Cowboy
Posts: 16084
88
Android Scala IntelliJ IDE Spring Java
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Looks interesting.

The course is using Python (PySpark) so you'll have to know at least a little Python.
 
With a little knowledge, a cast iron skillet is non-stick and lasts a lifetime.
    Bookmark Topic Watch Topic
  • New Topic