Big Data/Analytics Zone is brought to you in partnership with:

Alec is a Content Curator at DZone. He lives in Raleigh and spends his free time writing and programming. Alec is a DZone Zone Leader and has posted 530 posts at DZone. You can read more from them at their website. View Full User Profile

Big Data and Python: Utilizing Python for Large-scale Datasets

10.04.2013
| 3092 views |
  • submit to reddit
This recent video from NewCircle Training discusses the use of Python for querying mass quantities of data. Despite the slow speed of Python, at least compared to languages like C++, AdRoll demonstrates how it can perform very efficiently even with large-scale datasets.

Ville Tuulos, Principle Engineer at AdRoll, a company producing tons of big data, demonstrates how AdRoll uses Python to squeeze every bit of performance out of a single high-end server. They manage it with Numba, a new NumPy aware dynamic Python compiler based on LLVM, and thanks to Python, the system can provide a very expressive and developer-friendly API. Find out more in this informative talk from the San Francisco Python Meetup Group.            
- YouTube Page  

You can also check out the slides by themselves if you don't have time for the whole video.