16-18 July 2015, Bangalore
Status: Awaiting jury selection

Machine Learning, Distributed and Parallel Computing, and High-performance Computing are the themes for this year’s edition of Fifth Elephant.

The deadline for submitting a proposal is 15th June 2015

We are looking for talks and workshops from academics and practitioners who are in the business of making sense of data, big and small.

Track 1: Discovering Insights and Driving Decisions

This track is about general, novel, fundamental, and advanced techniques for making sense of data and driving decisions from data. This could encompass applications of the following ML paradigms:

Across various data modalities including multi-variate, text, speech, time series, images, video, transactions, etc.

Track 2: Speed at Scale

This track is about tools and processes for collecting, indexing, and processing vast amounts of data. The theme includes:

Commitment to Open Source

HasGeek believes in open source as the binding force of our community. If you are describing a codebase for developers to work with, we’d like it to be available under a permissive open source license. If your software is commercially licensed or available under a combination of commercial and restrictive open source licenses (such as the various forms of the GPL), please consider picking up a sponsorship. We recognize that there are valid reasons for commercial licensing, but ask that you support us in return for giving you an audience. Your session will be marked on the schedule as a sponsored session.


If you are interested in conducting a hands-on session on any of the topics falling under the themes of the two tracks described above, please submit a proposal under the workshops section. We also need you to tell us about your past experience in teaching and/or conducting workshops.

Confirmed sessions

# Speaker Section Level +1 Submitted
1 "Thinking Machines"
Shailesh Kumar (@shkumar) Keynote Advanced 1 0 Tue, 7 Jul
2 Future patterns in data ecosystem
Amod Malviya Sponsored Keynote Intermediate 2 0 Tue, 7 Jul
3 Igniting your data with Apache Spark
Yagnik (@yagnik) Workshop Beginner 0 5 Thu, 2 Jul
4 Deploying Batch and Streaming Architectures on AWS
Russell Nash (@russnash) Sponsored Intermediate 4 0 Thu, 18 Jun
5 Data Comes in Shapes
Tim Poston (@timposton) Keynote Beginner 5 0 Tue, 16 Jun
6 When Apache ZooKeeper is good fit
Rakesh R (@rakeshadr) Crisp Talk Intermediate 20 0 Tue, 16 Jun
7 Dead Simple Scalability Patterns
Vedang Manerikar (@vedang) Crisp Talk Beginner 34 0 Mon, 15 Jun
8 Call me maybe: Jepsen and flaky networks  
Shalin Mangar (@shalinmangar) Full Talk Advanced 15 0 Mon, 15 Jun
9 Graph Algorithms and Computer Vision  
Sumod Mohan (@sumod) Full Talk Intermediate 7 0 Mon, 15 Jun
10 Harnessing the power of the Erlang VM at Housing    
Abhijit Pratap Singh (@sabhi) Crisp Talk Intermediate 13 0 Mon, 15 Jun
11 Exploratory data analysis using Apache Lens and Apache Zeppelin
Pranav Agarwal (@praagarw) Crisp Talk Intermediate 8 1 Mon, 15 Jun
12 Keeping Moore's law alive: Neuromorphic computing
Anand Chandrasekaran (@madstreetden) Full Talk Beginner 20 0 Mon, 15 Jun
13 Deep Learning for Natural Language Processing    
Devashish Shankar (@devashishshankar) Full Talk Intermediate 18 1 Mon, 15 Jun
14 Joining data streams at scale for fun and profit
Aniruddha Gangopadhyay (@aniruddha9591) Crisp Talk Beginner 8 0 Mon, 15 Jun
15 Hardware Accelerated Big Data Processing
Reetinder Sidhu (@sidhu1f) Crisp Talk Intermediate 6 0 Mon, 15 Jun
16 Revolutionizing travel with ML & Analytics – An insight into business optimization using Machine Learning and Advanced Analytics
Raghu Kashyap (@ragskashyap1) Full Talk Intermediate 29 1 Mon, 15 Jun
17 Building a E-commerce search engine: Challenges, insights and approaches
Vinodh Kumar R (@vinodhkumarr) Sponsored Beginner 10 0 Mon, 15 Jun
18 Are these the same pair of shoes? - Matching retail products at scale
Nikhil Ketkar (@nikhilketkar) Full Talk Intermediate 76 1 Mon, 15 Jun
19 Using Modes for Time Series Classification  
Rohit Chatterjee (@rohitchatterjee) Crisp Talk Beginner 5 0 Mon, 15 Jun
20 Apache Tez - Present and Future
Rajesh Balamohan Full Talk Intermediate 3 0 Mon, 15 Jun
21 Approximate algorithms for summarizing streaming data  
Himadri Sarkar (@himadri) Full Talk Intermediate 45 5 Sun, 14 Jun
22 CAP Theorem: You don’t need CP, you don’t want AP, and you can’t have CA
Siddhartha Reddy (@sids) Full Talk Intermediate 20 4 Sun, 14 Jun
23 POC: How to slice, dice & search billions of users events in seconds (from scratch)  
Bhasker Kode (@bhaskerkode) Crisp Talk Beginner 11 0 Sun, 14 Jun
24 The many ways of parallel computing with Julia
Viral B. Shah (@viralbshah) Full Talk Beginner 5 0 Sun, 14 Jun
25 Escher - democratizing beautiful visualizations
Shashi Gowda (@g0wda) Crisp Talk Beginner 5 1 Fri, 12 Jun
26 Recommendation System beyond traditional Collaborative filtering  
Gagan Agrawal (@gagana24) Full Talk Intermediate 7 0 Fri, 12 Jun
27 Running natural language queries against NoSQL schema
Deepak Krishnan (@deepakgk) Crisp Talk Advanced 30 3 Thu, 11 Jun
28 Search at Petabyte scale
Anup Nair (@anair) Crisp Talk Intermediate 8 0 Thu, 11 Jun
29 Building tiered data stores using Aesop to bridge SQL and NoSQL systems  
Regunath Balasubramanian (@regunathb) Full Talk Intermediate 9 0 Wed, 10 Jun
30 HawkEye: A Real-Time Anomaly Detection System  
Satnam Singh, PhD (@satnam-datageek) Crisp Talk Beginner 9 2 Mon, 8 Jun
31 A review of important results in distributed systems
Vaidhy Gopalan (@vaidhy) Full Talk Intermediate 10 1 Thu, 28 May
32 Making a contextual recommendation engine using Python and Deep Learning at ParallelDots  
Muktabh Mayank (@muktabhm) Crisp Talk Beginner 10 0 Wed, 27 May
33 Critical pipe fittings: What every data pipeline requires
Yagnik (@yagnik) Full Talk Intermediate 5 0 Wed, 27 May
34 Understanding supervised machine learning hands on!  
Harshad Saykhedkar (@harshss) Workshop Beginner 14 1 Mon, 25 May
35 Processing large data with Apache Spark    
Venkata Naga Ravi (@venkatanagaravi) Full Talk Intermediate 3 0 Sat, 23 May
36 Building Recommender system
Swaroop Krothapalli (@swaroop) Crisp Talk Beginner 5 0 Thu, 21 May
37 Visualising Multi Dimensional Data  
Amit Kapoor (@amitkaps) Full Talk Intermediate 9 0 Tue, 19 May
38 Introduction to Deep Learning  
Bargava Subramanian (@barsubra) Workshop Intermediate 20 0 Mon, 18 May
39 Two Years Wiser: The Nilenso Experiment
Steven Deobald (@stevendeobald) Full Talk Beginner 6 0 Mon, 11 May
40 Instrumenting your kafka & storm pipeline  
Bhasker Kode (@bhaskerkode) Full Talk Intermediate 12 0 Mon, 11 May

Unconfirmed proposals

# Speaker Section Level +1 Submitted
1 Real Time Bid Modification @ Million Requests per second...
Jatinder Singh (@jatinder) Crisp Talk Intermediate 5 0 Wed, 17 Jun
2 Introduction to MaelStorm and Performance Engineering
Jatinder Singh (@jatinder) Workshop Advanced 2 0 Tue, 16 Jun
3 Reviews and Ratings Spam Detection
Mohit Kumar (@mohitkum) Crisp Talk Intermediate 9 1 Mon, 15 Jun
4 AB testing: What, Why & How
Renuka Khandelwal (@renuka) Full Talk Beginner 9 0 Mon, 15 Jun
5 Building a distributed cache system with redis, clojure and math
Kapil Reddy (@kapilr) Full Talk Intermediate 23 0 Mon, 15 Jun
6 From Search to Discovery at Housing
Mudit Gupta (@mudit-housing) Full Talk Beginner 10 0 Mon, 15 Jun
7 High Performance Tiled Map Service    
Shubham Bansal (@shubham-bansal) Full Talk Intermediate 2 0 Mon, 15 Jun
8 Think Incremental with hive.
ravi teja (@ravi-teja) Crisp Talk Intermediate 11 0 Mon, 15 Jun
9 Scalable real-time personalized recommendation system
Jasvinder Singh (@jasvinder) Full Talk Intermediate 6 0 Mon, 15 Jun
10 How to stop admiring and start using Deep Learning
Vivek Mehta (@vivekmehta) Full Talk Intermediate 20 0 Mon, 15 Jun
11 Holistic Security Process for Humanitarian Projects
chinmayi sk Full Talk Intermediate 7 1 Mon, 15 Jun
12 Stream Processing in production: Metrics that matter
Siddhartha Reddy (@sids) Crisp Talk Intermediate 11 0 Mon, 15 Jun
13 Map Tile Server
Niranjan Bala V (@niranjanbalav) Crisp Talk Intermediate 45 0 Mon, 15 Jun
14 Designing distributed components in a multi tenant architecture
Ronak (@ronak-kothari) Full Talk Intermediate 7 0 Mon, 15 Jun
15 Data Infrastructure for Real Time Analysis of User Click Stream Data  
Aditya Prasad Narisetty (@adityaprasadn) Full Talk Beginner 9 0 Mon, 15 Jun
16 What does your website look like to a web-crawler
Gagandeep singh (@gagan-goku) Full Talk Intermediate 7 0 Mon, 15 Jun
17 Developing a Hybrid Recommender System for Some of Life’s Most Important Choices    
Paul Meinshausen (@pmeins) Full Talk Intermediate 7 0 Mon, 15 Jun
18 Solr compute cloud - An elastic Solr infrastructure
Suchitra Amalapurapu (@asmsuchi) Full Talk Advanced 11 0 Mon, 15 Jun
19 postgres clusters and their nuances  
Srihari Sriraman (@ssrihari) Full Talk Intermediate 2 0 Mon, 15 Jun
20 Practical Approach to Python based Supervised Machine Learning: User Generated Text Classification Techniques
Kausik Ghatak (@kausikg) Full Talk Intermediate 15 0 Mon, 15 Jun
21 An Integrated Weblog Processing and Machine Learning Workflow for Building and Deploying Intent Prediction Models at Scale  
Dhanesh Padmanabhan (@dhanesh123us) Full Talk Intermediate 4 0 Mon, 15 Jun
22 Building Real time solution within 30 minutes
Sudhir Rawat (@rawatsudhir) Crisp Talk Beginner 1 0 Mon, 15 Jun
23 Anatomy of Decision Trees using an example from Kaggle  
Saurabh Banerjee (@saurabhbanerjee) Full Talk Intermediate 18 5 Mon, 15 Jun
24 Getting Started with IoT
Sudhir Rawat (@rawatsudhir) Full Talk Intermediate 2 0 Mon, 15 Jun
25 Automating news discovery in real-time
Anand S (@sanand0) Full Talk Beginner 14 0 Mon, 15 Jun
26 Static & Interactive Exploratory Data Analysis in R  
Amit Kapoor (@amitkaps) Workshop Intermediate 4 0 Sun, 14 Jun
27 Deconstructing Linear Regression
Vishal (@vishalgokhale) Crisp Talk Beginner 4 0 Sun, 14 Jun
28 The many ways of parallel computing with Julia
Viral B. Shah (@viralbshah) Full Talk Beginner 4 0 Sun, 14 Jun
29 Big Data Engineering made easy
Kaushik Paranjape (@kaushik-paranjape) Full Talk Intermediate 4 4 Sun, 14 Jun
30 Benchmarks from JVM to Big Data  
Srinivasa Rao Aravilli (@aravilli) Full Talk Intermediate 4 0 Sun, 14 Jun
31 Ensemble Learning
Swaroop Krothapalli (@swaroop) Full Talk Beginner 11 0 Sat, 13 Jun
32 Aerospike : High Performance NoSQL store with flash optimization
Gagan Agrawal (@gagana24) Full Talk Intermediate 5 0 Sat, 13 Jun
33 Building Complex Data Workflows with Cascading on Hadoop
Gagan Agrawal (@gagana24) Full Talk Intermediate 5 0 Sat, 13 Jun
34 IT Operations Analytics: Using Text Analytics and Statistical Modeling in IT Operations Data
Vishnuteja Nanduri (@vishnunanduri) Full Talk Intermediate 1 1 Mon, 8 Jun
35 High Performance Computing in R    
Ravishankar Rajagopalan (@vioravis) Workshop Intermediate 19 0 Wed, 3 Jun
36 Anomaly Detection Using Apache Spark
Kiran Veigas (@kiranveigas) (proposing) Crisp Talk Advanced 6 1 Mon, 1 Jun
37 Squirrel – Enabling Accessible Analytics for All    
sudipta mukherjee (@samthecoder) Crisp Talk Intermediate 7 0 Sun, 31 May
38 Leveraging Cloud for BigData Analytics - Patterns, Options and Practical Next Steps
Amit Jain (@jnamit) Full Talk Intermediate 3 0 Thu, 28 May
39 Securing your Enterprise Hadoop Cluster
Manoj Sundaram (@manojsundaram) Full Talk Intermediate 33 0 Wed, 27 May
40 Building Spark as Service in Cloud using YARN
Rajat (@rgupta) Full Talk Intermediate 27 0 Mon, 25 May
41 Big Data Benchmarking    
Venkata Naga Ravi (@venkatanagaravi) Full Talk Intermediate 1 0 Sat, 23 May
42 On building a cloud-based black-box predictive modeling system
Bargava Subramanian (@barsubra) Full Talk Beginner 5 0 Thu, 21 May
43 Building Data Products for Small / Mid-Sized Data
Ramesh Sampath (@sampathweb) Full Talk Intermediate 7 0 Tue, 12 May
44 Deprecating MapReduce Patterns with Apache Spark
Rahul Kavale (@rahulkavale) Full Talk Intermediate 3 2 Thu, 7 May
45 Scrap Your MapReduce - Introduction to Apache Spark
Rahul Kavale (@rahulkavale) Full Talk Beginner 7 0 Thu, 7 May
46 Anatomy of RDD : A Deep dive into Spark RDD Data structure.  
Madhukara Phatak (@phatak-dev) Full Talk Advanced 14 0 Wed, 6 May
47 Big data analysis with Apache Spark
Madhukara Phatak (@phatak-dev) Workshop Beginner 9 0 Wed, 6 May
48 Networks and Network Analysis
Dr. Jai Ganesh (@jaiganesh) Full Talk Advanced 7 0 Mon, 27 Apr
49 Tackling ML's black boxes with probabilistic programming
Rudraksh MK (@rudrakshmk) Full Talk Advanced 13 0 Sat, 18 Apr