Hi [[ session.user.profile.firstName ]]

[SEASON 2 EP 5] Finding Community in Data Science with Reshama Shaikh

Now that we’ve covered how open source works, we’re looking to pull back the curtain and see who’s actually contributing. In part 2/2 of our open source series, we sat down with Reshama Shaikh, a statistician and key organizer of scikit-Learn sprints, to learn about the ups & downs of open source contributing, as well how a Sprint in Nairobi benefits Fortune 500 companies in the US.

Reshama Shaikh is an independent data scientist/statistician and MBA with skills in Python, R and SAS. I worked for over 10 years as a biostatistician in the pharmaceutical industry.

Further Reading:

Stack Overflow Developer Survey; Open Source Contributors: https://insights.stackoverflow.com/survey/2019#developer-profile-_-contributing-to-open-source

How to Organize a Scikit Learn Spring by Reshama Shaikh: https://reshamas.github.io/how-to-organize-a-scikit-learn-sprint/

Reshama Shaikh’s Website: https://reshamas.github.io/

Contributing to scikit-Learn: https://scikit-learn.org/stable/developers/contributing.html

Gitter scikit-Learn: https://gitter.im/scikit-learn/scikit-learn

scikitLearn Mailing List: https://mail.python.org/mailman/listinfo/scikit-learn
Recorded Jan 3 2020 25 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Reshama Shaikh, Will Nowak, Triveni Gandhi
Presentation preview: [SEASON 2 EP 5]  Finding Community in Data Science with Reshama Shaikh

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Migrating Away From Excel: Use Case Deep Dive & Q&A Nov 24 2020 3:00 am UTC 31 mins
    Will Nowak, Senior Solutions Engineer, Dataiku
    It's no secret that Excel poses massive limitations for organizations as they look to most effectively utilize data, with its lack of functions for size and volume, auditability, and automated workflows. In this session, we'll provide you with a solution to bypass the frustration. Enter Dataiku as an alternative, and learn how to work on big data as easily as small data. In this session, our financial services expert and solutions engineer, Will Nowak, will address this key challenge through use cases, notable examples, and industry observations from our customers in the space.
  • [Dataiku x Bots & AI] Data Science in Compliance and Fraud Detection Nov 18 2020 7:00 pm UTC 75 mins
    Harry Lu (Data Analytics Manager @ Spotify)
    Tentative Schedule: (EST)

    2:00pm: Intro
    2:05pm: Data Science in Compliance and Fraud Detection w/ Spotify
    2:45pm: Q&A

    Talk Abstract:

    Data Science is an emerging function in a variety of industries and a greater number of data scientists have begun working on personalization, recommendations, or sales optimizations.
    The cost of compliance has also been expanding in most industries and especially in the technology sector. This is a result of the Public’s attention shift from traditional frauds to antitrust, conflict of interest, and data privacy. While there are plenty of opportunities to leveraging DS/ML to solve such problems, the complex nature of such compliance and fraud detection issues stymies data practitioners from being able to grow the data science practice. The “language barrier” of communication between the company’s DS/business/compliance appears to hide the low hanging fruits.
    In this talk, Harry will share his experience in connecting data science with compliance, examples of the DS/ML use cases, and key takeaways for our two groups of the audience (Data Scientists and Business Stakeholders.)
    • Hidden Data Science Opportunity in Compliance and Finance
    • Career Journey: from a fraud investigator to a data scientist
    • Opportunities that often get overlooked in compliance/fraud functions
    • How to speak two languages: recommendations on connecting data science resources with compliance business

    Disclaimer: All views, thoughts, & opinions expressed in the webinar belong solely to the panelists, & not to the panelists’ employer, organization, committee, other group or individual.
  • Dataiku Demo Days Ep.4: Optimizing Preventative Maintenance Nov 17 2020 7:00 pm UTC 60 mins
    Aashish Majethia
    Leveraging AI is an efficient way to provide real-time visibility into the production process to reduce downtime for maintenance and costs for efficient operations. Join us as we walk through how sensor data can be transformed into timely insights via predictive maintenance with automated insight improvement.

    During this webinar, we will:
    - Determine when purchased equipment might fail to deploy resources to service customers
    - Look at root cause analysis and model drift.
  • The Human Role in AutoML Nov 12 2020 6:00 am UTC 44 mins
    Timm Grosser, Senior Analyst, Business Applications Research Center (BARC)
    Will AI replace the Data Scientist?

    AI promises a lot of potential for speeding up processes and making tools, functions, even entire workflows easier to use. This is valid throughout the entire analytics process, from model creation to model operation.

    During the webinar, we will learn:
    1) Potential and impacts of AI
    2) Illustrate this using the example of the Advanced Analytics process.
    3) Outlook on future application scenarios of AI
  • ML-based Fraud Detection and Prevention in Healthcare Nov 6 2020 3:00 am UTC 66 mins
    Grant Case, Solution Engineering Director
    Fraud in the healthcare industry is on the rise globally and APAC is no exception. There has been a considerable increase in fraud/deceptive activities in the APAC region owing to the growing penetration of the internet and an increase in the use of mobile internet.
     
    Preventing Fraud has the potential to make medicine better, more affordable, and more accessible.

    During the webinar, we will cover the following:

    Define what is Healthcare fraud & evaluate what can be done to detect and prevent fraud
    Deep dive 4 different options to combat fraud
    Finally, see how to combine the traditional and Machine-Learning Based methods within a Machine Learning Framework

    Participate in a data science project showcase followed by Q&A with one of our Sales Engineering director.
  • Hype und Realität von KI und Data Science in datengesteuerten Unternehmen Nov 5 2020 1:00 pm UTC 120 mins
    Manuel Nitzsche/ Dataiku, Clarissa Vogelbacher/ ITM Predictive GmbH, Frank Oechsle/ Esentri
    Wie sieht die Transformation zu einem datengesteuerten Unternehmen aus? Wie wird ein Data Science Projekt umgesetzt und für welche Anwendungsfälle? Nach der 1-stündigen Vorstellung mit Experten zu diesen Themen zeigen wir Ihnen exklusiv den Film "Data Science Pioneers" über die Realität und Herausforderungen eines Data Scientists und die Chancen von Data Science für Unternehmen und unseren Alltag.
  • Comment engager durablement vos clients grâce à l'usage de la data ? Nov 5 2020 10:00 am UTC 60 mins
    Alexia Dumas (LineUp 7), Alexis Kesseler (LineUp 7)
    Si on vous dit Capital Client, Patrimoine Client, Connaissance Client, Personnalisation, Engagement....ça vous parle ?

    Venez nous rejoindre au webinar du 5 novembre pour découvrir comment LineUP 7, agence de Data Marketing, accompagne ses clients dans la conception de dispositifs personnalisés et performants permettant de réengager durablement vos clients ! Et tout cela grâce à un usage optimal de la Data !

    Découvrez:
    - Comment utiliser & capitaliser sur la Data pour engager vos clients
    - Quels leviers Data activer pour optimiser votre ROI ?
    - Comment consolider votre base client et optimiser votre expérience client ?
    - Comment l'usage de la Data peut être un atout majeur pour répondre à vos besoins métier ?

    Intervenants:
    - Alexia Dumas, Consulting Manager chez LineUP 7
    - Alexis Kesseler, Data Scientist chez LineUP 7

    En vous inscrivant à ce webinaire, vous acceptez que vos informations soient partagées avec le partenaire de Dataiku, LineUP 7.
  • Roll out insights to the whole business using Data Science Platform Nov 5 2020 6:00 am UTC 53 mins
    Slava Razbash
    You might have an excellent data science team, but how do they productionalize their insights so that they are accessible by all stakeholders, all the time? If your team is emailing spreadsheets with confidential information around the company, it's time for a better solution.
     
    Slava will present how teams can use Dataiku DSS to securely share insights with stakeholders via a centralized platform.
  • Understand the US Elections with Data Science Feat. Jupiter Asset Management Recorded: Oct 29 2020 24 mins
    Leo Murison, Data Scientist at Jupiter Asset Management
    The leader of the free world will be elected in just under a month’s time. While sophisticated models
    from the Economist and FiveThirtyEight both put Joe Biden in front, it is by no means a certainty.

    In this talk, we demonstrate an end-to-end project in Data Science Studio (DSS) that can deliver timely insights into the presidential races in each state.
    We show how DSS can be used to:
    + Collect
    + Transform
    + Present data
    All the while giving stakeholders the ability to customise specific attributes. Lastly, we illustrate how DSS can be used in conjunction with other common technologies, simplifying how data science teams can interact with other areas of the business.

    About the speaker:
    Leo Murison is a data scientist at Jupiter Asset Management, a leading active fund management
    house with ~£56 billion of assets under management. Since joining in September 2019, he has
    helped produce a number of highly predictive machine learning models leverage alternative data,
    including the first productionised model using Dataiku. Previously, Leo worked in the data science
    team at DAZN, a global sports streaming service with millions of global subscribers, where he
    developed models to predict customer behaviour. Leo holds a degree in Neuroscience from Edinburgh University.
  • Dataiku Demo Days Ep.3: Sales Forecasting and Geocoding Recorded: Oct 28 2020 44 mins
    Emma Irwin, Sales Engineer, Dataiku & Claude Perdigou, Senior Product Manager, Dataiku
    Understanding the location data of sales and having the ability to accurately forecast revenue are critical components for a business’s success. Join us for Demo Days where we’ll show you how you can build predictive models to predict revenue for the coming days or weeks, and understand, optimize, and visualize your data by location in order to optimize business practices and streamline your day-to-day operations.

    Dataiku Demo Days is a series of expert-led demos on various high-value AI use cases, such as driving efficiencies in the data-to-insights process and maximizing campaign impact with AutoML. These digestible sessions are designed to help jumpstart your organizations’ data efforts and inject agility at every step of the process.
  • Fraud Detection- How to Operationalize your Models? Recorded: Oct 27 2020 49 mins
    Alexandre Hubert, Sales Engineering Director
    We are pleased to bring the second installment of a two-part webinar series on Fraud Detection. During the 1st webinar, we addressed why fraud hasn’t been solved yet. Addressed the need to move beyond a rule-based approach and how to navigate the creative thinking of fraudsters.

    During this webinar, we will look at fraud detection from a platform perspective and discover how to integrate ML techniques to the existing rule-based approach within an ML Framework and how to feed your fraud detection pipeline with right set of algorithms in order to ease the implementation of your use cases.
  • Fraud Detection: Why It Hasn't Been Solved Yet? Recorded: Oct 20 2020 26 mins
    Alexandre Hubert, Sales Engineering Director
    Global fraud in 2019 was nearly $60 billion, demonstrating how it is a global problem and not siloed to one industry. Fraudsters are always looking for new ways to subvert legitimate transaction systems — traditional rules-based approaches are no longer sufficient (or efficient enough) to combat fraud.

    During the first installment of this two-part webinar series, you will discover why there is a need to move beyond a rules-based approach for fraud detection, how to navigate the creative thinking of fraudsters, and the many factors that go into a machine learning fraud detection model
  • Foundational Data Science for Personalized Communications w/ Nike Recorded: Oct 19 2020 49 mins
    Ankit Gupta (Sr. Data Scientist @ Nike)
    Tentative Schedule: (EST)

    7:00pm: Intro
    7:05pm: Foundational Data Science for Personalized Communications w/ Nike
    7:45pm: Q&A

    Talk Abstract:

    Email communication is one of the most common activities on PCs and mobile devices and this holds true now more than ever. Nike invests a lot of effort into Email communication. Although the cost of sending one email may be small, the cost builds up as the number of emails aggregates. Also, the user engagement governs the reputation of Nike IP addresses and the KPIs. Therefore, it is important to identify the campaigns relevant to consumers with a high propensity of engagement.
    The goal of the Personalized Communications team is to serve the Nike consumers with the most relevant campaign emails at the right time with the right frequency. In this talk, Ankit will address important questions/topics relevant to the Personalization Communications team. He will also answer questions that explain how Nike is currently handling personalized communications, what’s working well and what’s not, and how to build the data foundation for Personalized Communications. Finally, Ankit will do a deep-dive into some of the data science models that the team is currently working on.

    Speaker bio:

    Ankit has 4+ years of experience informing business decisions through data science and statistical modeling. He is currently working as Sr. Data Scientist at Personalization and Data Science team at Nike. His previous experiences include working in Ad Tech and Finance industry.
  • Machine Learning for Optimizing Cloud Server Spend with Aptitive Recorded: Oct 19 2020 26 mins
    Ryan Lewis - Data Science Consultant at Aptitive
    Calculating your capacity needs for a cloud database server is not for the faint of heart. Underestimate and you’re stuck without enough capacity to run your operations. Overestimate and you could overspend by millions.

    In this session, Ryan Lewis from Aptitive will show you how he used machine learning to not only calculate the capacity needed but also to optimize spend based on the time of day and the day of the week. This solution helped a global financial institution to save an estimated $19 millions dollars annually by toggling server capacity as needed.
  • What can you do with unstructured text data? w/ PwC Recorded: Oct 15 2020 47 mins
    Abdallah Musmar (Data Science Lead @ PwC)
    Tentative Schedule: (EST)

    2:00pm: Intro
    2:05pm: What can you do with unstructured text data? w/ PwC
    2:45pm: Q&A

    Talk Abstract:

    In this talk we will explore the opportunities that arise from unstructured text data. Then we will take a deep dive into a few concepts that are used in applying Machine Learning to text data and discuss how can they be leveraged using deep learning and other methods

    Speaker Bio:

    Abdallah Musmar is a Manager at PricewaterhouseCoopers. He is a technical lead for machine learning projects that help the firm improve their internal business processes. He also is a PhD candidate at the University of South Florida and his research focuses on the use of Machine Learning and Agent Based Modeling in Business. He also is managing a Citizen Data Science program for undergraduate business students at the University of South Florida which he built from scratch. In 2020, he has received an award for one of his projects to be one of "The best of AI in PwC" in a worldwide AI summit made internally by the firm.

    Disclaimer: All views, thoughts, & opinions expressed in the webinar belong solely to the panelists, & not to the panelists’ employer, organization, committee, other group or individual.
  • Improve Supply Chain and Marketing Data Science Models with Weather Data Recorded: Oct 8 2020 57 mins
    Christian Schneider (wetter.com), France Hureaux (wetter.com)
    From production to customer behavior, weather is a powerful parameter to monitor and master. Meteonomiqs data science team will share with you some of the technical requirements to build the best-in-class models including weather data. We demonstrate together with Dataiku DSS the usage of the plugin and illustrate how easy it is for citizen data scientists to build data flows.

    Speakers:
    - Christian Schneider, Senior Data Scientist at wetter.com
    - France Hureaux, Senior Strategy Manager at wetter.com

    Please be aware that by registering for this webinar, you agree to have your personal information shared with Dataiku's partner wetter.com. They may contact you with information that could be of interest to you.
  • Completing the Demand Planning Cycle with Data Science Recorded: Oct 1 2020 60 mins
    Dan Roozemond (EyeOn), Martin Daudey (EyeOn)
    The steps for a typical demand planning cycle are: review & correct historical data, statistical base line forecast, enrichment by marketing/sales, consolidate and consensus meeting, validate results & improve.

    Learn in this session how data science can help you completing the cycle in shorter intervals and with higher quality by integrating a data science platform into the planning cycle.

    Speakers:
    - Dan Roozemond, Data Science Team Lead at EyeOn
    - Martin Daudey, Sr. Business and Solutions Consultant at EyeOn

    Please be aware that by registering for this webinar, you agree to have your personal information shared with Dataiku's partner EyeOn. They may contact you with information that could be of interest to you.
  • Large Scale Machine Learning via SQL on Google BigQuery with BQML w/ Google Recorded: Sep 29 2020 58 mins
    Sanjay Agravat (Machine Learning Specialist @ Google Cloud)
    Tentative Schedule: (EST)

    2:00pm: Intro
    2:05pm: Large Scale Machine Learning via SQL on Google BigQuery with BQML w/ Google
    2:45pm: Q&A

    Talk Abstract:

    In this talk, Sanjay will discuss how to perform machine learning using SQL for a variety of model types and the flexibility of using BQML to import and export models.

    Speaker bio:

    Sanjay Agravat is a Machine Learning Specialist for Google Cloud based out of Atlanta, GA. Sanjay started his career in Software Engineering and later pursued an academic calling as he completed his PhD in Biomedical Informatics at Emory University and went on to do a fellowship at Harvard Medical School. Now at Google Cloud, he helps educate customers on how to leverage Google Cloud Platform for ML/AI solutions and support them in their journey along the way.

    Disclaimer: All views, thoughts, & opinions expressed in the webinar belong solely to the panelists, & not to the panelists’ employer, organization, committee, other group or individual.
  • Dataiku Demo Days Ep.2: ML-Based Marketing Attribution Models Recorded: Sep 29 2020 50 mins
    Greg Cashman, Senior Sales Engineer, Dataiku
    Historically, marketing attribution has been a painstakingly manual process based on heuristic models that often turn out to be more difficult (and less effective) than necessary. And unfortunately, due to their perceived simplicity, many marketing teams turn to outdated techniques such as the last click heuristic, where all the conversion merits are attributed to the last media contact or channel the customer was exposed to. This does not adequately capture user engagement, and risks undervaluing campaigns and content that are critical to conversion. Fortunately, the significant advances in marketing AI and machine learning (ML) in recent years allow organizations to solve perennial problems in new, more efficient ways. Join us for Demo Days and in just 30 minutes we will show you how to build, deploy, monitor and optimize ML-based attribution models.

    Dataiku Demo Days is a series of expert-led demos on various high-value AI use cases, such as driving efficiencies in the data-to-insights process and maximizing campaign impact with AutoML. These digestible sessions are designed to help jumpstart your organizations’ data efforts and inject agility at every step of the process.
  • Making Deep Learning Solutions Accessible with Dataiku DSS Apps Recorded: Sep 29 2020 36 mins
    Dr. Robert Coop
    The Dataiku DSS 8.0 release introduces Apps, the ability to distribute your analytic project to a much broader audience such as subject matter experts and business analysts.


    In this session, Dr. Robert Coop, phData’s General Manager of Machine Learning, will demonstrate how apps can be used to allow end-users to classify emotions expressed by people in videos using deep learning. This talk will demonstrate how to take a complex project and to package it into an application that enables users to benefit from the results of deep-learning emotion classification without having to understand the analytic process.
Your Path to Enterprise AI
Dataiku is the centralized data platform that moves businesses along their data journey from analytics at scale to enterprise AI. By providing a common ground for data experts and explorers, a repository of best practices, shortcuts to machine learning and AI deployment/management, and a centralized, controlled environment, Dataiku is the catalyst for data-powered companies.

Customers like Unilever, GE, BNP Paribas, Santander use Dataiku to ensure they are moving quickly and growing exponentially along with the amount of data they’re collecting. By removing roadblocks, Dataiku ensures more opportunity for business-impacting models and creative solutions, allowing teams to work faster and smarter.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: [SEASON 2 EP 5] Finding Community in Data Science with Reshama Shaikh
  • Live at: Jan 3 2020 7:15 pm
  • Presented by: Reshama Shaikh, Will Nowak, Triveni Gandhi
  • From:
Your email has been sent.
or close