Week 52
5 Minutes of Data Science - week 52 of 2022
Highlights from December 26 to January 01
Foreword
Happy new year, folks!
It's always nice to reach the end of the year and see all the recap articles. Plenty of them in this issue. Enjoy the goodies from last week!
And see you next week!
Blogs
- Stories that inspired us in 2022, by Amazon Science
- The 10 top articles of 2022, by Amazon Science
- Top 10 blog posts of 2022, by Amazon Science
- The 10 most viewed publications of 2022, by Amazon Science
- Improving automatic discrimination of logos with similar texts, _by Amazon Science
Podcasts
- Reinforcement Learning for Personalization at Spotify with Tony Jebara, by The TWIML AI
- Will ChatGPT take my job?, by The TWIML AI
- Data-Driven Thinking for the Everyday Life, by DataFramed
Youtube
- Prof. PEDRO DOMINGOS - There are no infinities, utility functions, neurosymbolic, by Machine Learning Street Talk
- Pre screening tests be like, at r/Data Science (💬113)
- The job description of this unpaid internship is insane, at r/Data Science (💬135)
- ChatGPT Extension for Jupyter Notebooks: Personal Code Assistant, at r/Data Science (💬31)
- We finally got Text-to-PowerPoint working!! (Generative AI for Slides ✨), at r/Machine Learning (💬51)
- Cramming: Training a Language Model on a Single GPU in One Day, at r/Machine Learning (💬24)
- Compromised PyTorch-nightly dependency, at r/Machine Learning (💬21)
- How would you explain "degrees of freedom" to a non-stat expert?, at r/Ask Statistics (💬11)
- Is there a statistical test that can be done to see if there is a big difference between the numbers per columns?, at r/Ask Statistics (💬27)
- Statistics question thats literally bothering me, at r/Ask Statistics (💬5)
- 2022: A Year Full of Amazing AI papers - A Review, at r/Latest in ML (💬1)
Github jupyter notebook trends
- Open-Assistant: OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
- annotated_deep_learning_paper_implementations: 🧑🏫59 Implementations/tutorials of deep learning papers with side-by-side notes📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...),🎮reinforcement learning (ppo, dqn), capsnet, distillation, ...🧠
- machine-learning-book: Code Repository for Machine Learning with PyTorch and Scikit-Learn
- Data-science: Collection of useful data science topics along with code and articles
- pytorch-deep-learning: Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
- Complete-Python-3-Bootcamp: Course Files for Complete Python 3 Bootcamp Course on Udemy
- tensorflow-deep-learning: All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
- Grokking-Deep-Learning: this repository accompanies the book "Grokking Deep Learning"
- ML-For-Beginners: 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
- Machine-Learning-Specialization-Coursera: Contains Solutions and Notes for the Machine Learning Specialization By Stanford University and Deeplearning.ai - Coursera (2022) by Prof. Andrew NG
- yolov7: Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
- be-theboss-in-python: This repo helps you to be the boss in Python.
- codespaces-jupyter: Explore machine learning and data science with Codespaces
- diff-svc: Singing Voice Conversion via diffusion model
- zero-to-mastery-ml: All course materials for the Zero to Mastery Machine Learning and Data Science course.
- mlops-zoomcamp: Free MLOps course from DataTalks.Club
- python-essential-training-2449125: Python Essential Training
- algorithmic-trading-python: The repository for freeCodeCamp's YouTube course, Algorithmic Trading in Python
- py: Repository to store sample python programs for python learning
- SQL-Data-Analysis-and-Visualization-Projects: SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.
- TTS: 🤖💬Deep learning for Text to Speech (Discussion forum:https://discourse.mozilla.org/c/tts)
- Stock-Prediction-Models: Gathers machine learning and deep learning models for Stock forecasting including trading bots and simulations
- data-engineering-zoomcamp: Free Data Engineering course!
- fastbook: The fastai book, published as Jupyter Notebooks
Github python trends
- gpt_index: An index created by GPT to organize external information and answer queries!
- DiT: Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
- PaLM-rlhf-pytorch: Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
- awesome-python: A curated list of awesome Python frameworks, libraries, software and resources
- python-cheatsheet: Comprehensive Python Cheatsheet
- trlx: A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
- CodeFormer: [NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
- tortoise-tts: A multi-voice TTS system trained with an emphasis on quality
- gpt-discord-bot: Example Discord bot written in Python that uses the completions API to have conversations with the
text-davinci-003
model, and the moderations API to filter the messages. - Auto-Synced-Translated-Dubs: Automatically translates the text of a video based on a subtitle file, and also uses AI voice to dub the video, and synced using the subtitle's timings
- ultimatevocalremovergui: GUI for a Vocal Remover that uses Deep Neural Networks.
- linkedin-skill-assessments-quizzes: Full reference of LinkedIn answers 2022 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers
Happy 2023!
Don't miss what's next. Subscribe to 5 minutes of Data Science: