Week 10 of 2023
5 Minutes of Data Science - week 10
Highlights from March 06 to March 12
Blogs
- Using hypergraphs to improve product retrieval, by Amazon Science
- The science behind Astro’s graceful, responsive motion, by Amazon Science
- Amazon announces new CMU graduate research fellows, by Amazon Science
- From structured search to learning-to-rank-and-retrieve, by Amazon Science
- Diffusion Probabilistic Fields, by Apple Machine Learning
Newsletters
- LWiAI Podcast #114 - ChatGPT applications, Claude, PALM-E, OpenAI criticism, AI-generated spam, by Last Week in AI
- Don’t “Plan for AGI” Yet, by Last Week in AI
- Last Week in AI #209: ChatGPT API, ChatGPT’s new rival, generative AI continues to make waves, and more!, by Last Week in AI
- Import AI 319: Sovereign AI; Facebook’s weights leak on torrent networks; Google might have made a better optimizer than Adam!, by Import AI
- Advanced Data Manipulation with Pandas, by The AI Edge
- Why XGBoost is better than GBM?, by The AI Edge
- The AiEdge+: All the Transformers Applications, by The AI Edge
- Maximizing the Potential of Large Language Models, by Gradient Flow
Podcasts
- Edge AI applications for military and space [RB] (Ep. 219), by Data Science At Home
- Bot Detection and Dyadic Surveys, by Data Skeptic
- End-to-end cloud compute for AI/ML, by Practical AI
- Robotic Dexterity and Collaboration with Monroe Kennedy III - #619, by The TWIML AI
- Biohacking for Data Scientists and ML Engineers - Ruslan Shchuchkin, by Data Talks
- Edge AI applications for military and space [RB] (Ep. 219), by Data Science at Home
Youtube
- Prof. KARL FRISTON 3.0 - Collective Intelligence [Special Edition], by Machine Learning Street Talk
- Visual ChatGPT (Microsoft), by Machine Learning Street Talk
- Prof. Noam Chomsky, the father of modern linguistics, talking about ChatGPT #machinelearning, by Machine Learning Street Talk
- The AI Buzz, Episode #5: A new wave of AI-based products and the resurgence of personal applications, by StatQuest
- CatBoost Part 2: Building and Using Trees, by StatQuest
Reddit’s top posts
- Against all stigma, I love being a SQL monkey!, at r/Data Science (💬156)
- Overpaid and don’t see the point, at r/Data Science (💬273)
- Rich Jupyter Notebook Diffs on GitHub… Finally., at r/Data Science (💬28)
- Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models, at r/Machine Learning (💬29)
- GPT-4 is coming next week – and it will be multimodal, says Microsoft Germany - heise online, at r/Machine Learning (💬81)
- [Discussion] Compare OpenAI and SentenceTransformer Sentence Embeddings, at r/Machine Learning (💬48)
- why is n-1 used instead of N for sample of a population?, at r/Ask Statistics (💬16)
- The possibility of having both your children be gay?, at r/Ask Statistics (💬18)
- I’m studying about linear regression and I had a question about the normal equation of simple linear regressions, at r/Ask Statistics (💬6)
- AI generated video chapter titles (YouTube, Vimeo, etc), at r/Latest in ML (💬5)
- Turn mockups into videos automatically! Gen-1, the future of storytelling? Gen-1 is the new Stable diffusion for videos by runwayml., at r/Latest in ML (💬1)
Github jupyter notebook trends
- stable-diffusion-webui-colab: stable diffusion webui colab
- openai-cookbook: Examples and guides for using the OpenAI API
- whisper: Robust Speech Recognition via Large-Scale Weak Supervision
- models: Models and examples built with TensorFlow
- Machine-Learning-Goodness: The Machine Learning repository contains ML/DL projects, notebooks, cheat codes of ML/DL/AI, useful information on AI/AGI and codes or coding snippets/scripts/tasks.
- stable-diffusion: A latent text-to-image diffusion model
- google-research: Google Research
- data-engineering-zoomcamp: Free Data Engineering course!
- CLIP: CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
- nerf: Code release for NeRF (Neural Radiance Fields)
- DL-Algorithms: None
- openvino_notebooks: 📚Jupyter notebook tutorials for OpenVINO™
- mlops-zoomcamp: Free MLOps course from DataTalks.Club
- Prompt-Engineering-Guide: 🐙Guides, papers, lecture, and resources for prompt engineering
- stable_diffusion_chilloutmix_ipynb: AUTOMATIC1111 Stable Diffusion WebUI 1.5 + ChilloutMix + Kohya’s Scripts
- notebooks: Notebooks using the Hugging Face libraries🤗
- introtodeeplearning: Lab Materials for MIT 6.S191: Introduction to Deep Learning
- learnopencv: Learn OpenCV : C++ and Python Examples
- yolov7: Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
- tutorials: MONAI Tutorials
Github python trends
- bilingual_book_maker: Make bilingual epub books Using AI translate
- text-generation-webui: A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
- llama: Inference code for LLaMA models
- llama_index: LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM’s with external data.
- xiaogpt: play chatgpt with xiaomi ai speaker
- 30-Days-Of-Python: 30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace.
- researchgpt: An open-source LLM based research assistant that allows you to have a conversation with a research paper
- langchain: ⚡Building applications with LLMs through composability⚡
- tinygrad: You like pytorch? You like micrograd? You love tinygrad!❤️
- ChuanhuChatGPT: GUI for ChatGPT API
- fast-stable-diffusion: fast-stable-diffusion + DreamBooth
- nebullvm: Plug and play modules to optimize the performances of your AI systems🚀
- OpenBBTerminal: Investment Research for Everyone, Anywhere.
- stable-diffusion-webui: Stable Diffusion web UI
- diagrams: 🎨Diagram as Code for prototyping cloud system architectures
- trl: Train transformer language models with reinforcement learning.
- chat-langchain: None
- PayloadsAllTheThings: A list of useful payloads and bypass for Web Application Security and Pentest/CTF
- nicegui: Create web-based UI with Python. The nice way.
- pandas: Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
- DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Don't miss what's next. Subscribe to 5 minutes of Data Science: