Édition #8 du lundi 6 avril 2020.
Bonne semaine!
Félix
🏆 Lien le plus populaire la semaine dernière: Top 20 Data Science YouTube Channels you should subscribe to in 2020
Articles, nouvelles et annonces
Uber Open-Sources ‘Fiber’, A Python Distributed Computing Library For Modern Computer Clusters
Uber Introduces Fiber, a Python-based distributed computing library for modern computer clusters. Now you can code/program your computer cluster instead of programming your desktop or laptop. It was originally developed to power large scale parallel scientific computation projects like POET, Go-Explore, GTN.
A visual debugger for Jupyter
Jupyter users like to experiment in the notebook, and to use the notebook as an interactive communication tool. However, for more classical software development tasks such as the refactoring of a large codebase, they often switch to general-purpose IDEs. [...] Today, after several months of development, we are glad to announce the first public release of the Jupyter visual debugger!
Google launches new tool to provide insights into social distancing
Google has launched a new tool which will show places where crowds had been gathering, in response to calls from public health officials to get better insights into whether social distancing measures are working to slow down the spread of Covid-19.
Building an Incremental Recommender System
A recommender system should ideally adapt to changes as they happen.
[Podcast] Thriving in a remote developer environment
Being effective while working remotely, away from the office, is an increasingly valuable skill that most of us in the tech industry have to quickly embrace.
[Video] Building Differentially private Machine Learning Models Using TensorFlow Privacy
This talk will introduce differential privacy and its use cases, discuss the new component of the TensorFlow Privacy library, and offer real-world scenarios for how to apply the tools.
[Video] Data Science @ The New York Times
The Data Science group at The New York Times develops and deploys machine learning solutions to newsroom and business problems. Re-framing real-world questions as machine learning tasks require not only adapting and extending models and algorithms to new or special cases but also sufficient breadth to know the right method for the right challenge.
Événements
[ONLINE] La science des données et l'analytique pour mieux sortir de la crise
Tuesday 07 April 2020 @ 12:00
Votre entreprise a accumulé une tonne de données : financières, marketing, opérationnelles. Vous avez une belle occasion de vous en servir pour y voir plus clair, mieux vivre la crise, mais surtout préparer la reprise.
[ONLINE] AI Model Governance in a High-Compliance Industry
Wednesday 08 April 2020 @ 14:00
Meetup Montreal R User Group: Model governance defines a collection of best practices for data science – versioning, reproducibility, experiment tracking, automated CI/CD, and others. Within a high-compliance setting where the data used for training or inference contains private health information (PHI) or similarly sensitive data, additional requirements are added.
[ONLINE] All Day Devops Conference
Friday 17 April 2020 @ 08:00
The world’s largest DevOps conference is back for a special Spring Break Edition on April 17, 2020.10 hours. 28 speakers. Free. Online.
[ONLINE] Reducing Machine Learning Inference Cost for PyTorch Models
Tuesday 21 April 2020 @ 16:00
In deep learning applications, inference accounts for up to 90% of compute cost. To reduce this high inference cost, you can use Amazon Elastic Inference, which allows you to attach just the right amount of GPU-powered inference acceleration to any EC2 or SageMaker instance type or ECS task.
[ONLINE] AWS | Databricks Cloud Data Lake Dev Day Virtual Workshop
Wednesday 22 April 2020 @ 09:00
In this virtual workshop, we’ll cover best practices for organizations to use powerful open source technologies to build and extend your AWS investments to make your data lake analytics ready.