Maloy Manna, Engineering, AXA Data Innovation Lab
Data in the real-world is almost always dirty, incomplete, scattered or inconsistent. For data scientists, 'janitor work' is a key hurdle to data insights.
Whether you use big data for analytics or data science, with increasing variety and velocity of big data, the data pre-processing step can be the most time-consuming step in your data pipeline.
Featuring engineering concepts and practical examples in Python and R, this webinar will focus on technical considerations and data engineering techniques to optimise data preparation to get the most value from your big data pipeline.