This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic
Friday, January 29 • 09:00 - 09:50
Tabular data analysis in Apache Spark using DataFrames

Sign up or log in to save this to your schedule and see who's attending!

The concept of DataFrames, a convenient way to work with tabular data, has been around for a while, but it has been mostly used in the R-world. Recently, the DataFrames has been introduced to Apache Spark, substantially simplifying analysis of data tables for the Big Data world. The talk will show the benefits of using DataFrames in Apache Spark along with some practical informations on the topic. Moreover, the talk would be a brief introduction to using Apache Spark.


Jakub Nowacki

Jakub is University of Bristol graduate where he obtained PhD in Engineering Mathematics. On the daily basis he utilizes his analytical and development skill working in software development. He is mostly interested in distributed processing and analysis of big data sets. Jakub originally has C/C++ background but currently works mostly in JVM and Python world.

Friday January 29, 2016 09:00 - 09:50
Kosmos - Room 02

Attendees (10)