5 minutes of Data Science

Subscribe
Archives
March 13, 2023

Week 10 of 2023

5 Minutes of Data Science - week 10

Highlights from March 06 to March 12


Blogs

  • Using hypergraphs to improve product retrieval, by Amazon Science
  • The science behind Astro’s graceful, responsive motion, by Amazon Science
  • Amazon announces new CMU graduate research fellows, by Amazon Science
  • From structured search to learning-to-rank-and-retrieve, by Amazon Science
  • Diffusion Probabilistic Fields, by Apple Machine Learning

Newsletters

  • LWiAI Podcast #114 - ChatGPT applications, Claude, PALM-E, OpenAI criticism, AI-generated spam, by Last Week in AI
  • Don’t “Plan for AGI” Yet, by Last Week in AI
  • Last Week in AI #209: ChatGPT API, ChatGPT’s new rival, generative AI continues to make waves, and more!, by Last Week in AI
  • Import AI 319: Sovereign AI; Facebook’s weights leak on torrent networks; Google might have made a better optimizer than Adam!, by Import AI
  • Advanced Data Manipulation with Pandas, by The AI Edge
  • Why XGBoost is better than GBM?, by The AI Edge
  • The AiEdge+: All the Transformers Applications, by The AI Edge
  • Maximizing the Potential of Large Language Models, by Gradient Flow

Podcasts

  • Edge AI applications for military and space [RB] (Ep. 219), by Data Science At Home
  • Bot Detection and Dyadic Surveys, by Data Skeptic
  • End-to-end cloud compute for AI/ML, by Practical AI
  • Robotic Dexterity and Collaboration with Monroe Kennedy III - #619, by The TWIML AI
  • Biohacking for Data Scientists and ML Engineers - Ruslan Shchuchkin, by Data Talks
  • Edge AI applications for military and space [RB] (Ep. 219), by Data Science at Home

Youtube

  • Prof. KARL FRISTON 3.0 - Collective Intelligence [Special Edition], by Machine Learning Street Talk
  • Visual ChatGPT (Microsoft), by Machine Learning Street Talk
  • Prof. Noam Chomsky, the father of modern linguistics, talking about ChatGPT #machinelearning, by Machine Learning Street Talk
  • The AI Buzz, Episode #5: A new wave of AI-based products and the resurgence of personal applications, by StatQuest
  • CatBoost Part 2: Building and Using Trees, by StatQuest

Reddit’s top posts

  • Against all stigma, I love being a SQL monkey!, at r/Data Science (💬156)
  • Overpaid and don’t see the point, at r/Data Science (💬273)
  • Rich Jupyter Notebook Diffs on GitHub… Finally., at r/Data Science (💬28)
  • Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models, at r/Machine Learning (💬29)
  • GPT-4 is coming next week – and it will be multimodal, says Microsoft Germany - heise online, at r/Machine Learning (💬81)
  • [Discussion] Compare OpenAI and SentenceTransformer Sentence Embeddings, at r/Machine Learning (💬48)
  • why is n-1 used instead of N for sample of a population?, at r/Ask Statistics (💬16)
  • The possibility of having both your children be gay?, at r/Ask Statistics (💬18)
  • I’m studying about linear regression and I had a question about the normal equation of simple linear regressions, at r/Ask Statistics (💬6)
  • AI generated video chapter titles (YouTube, Vimeo, etc), at r/Latest in ML (💬5)
  • Turn mockups into videos automatically! Gen-1, the future of storytelling? Gen-1 is the new Stable diffusion for videos by runwayml., at r/Latest in ML (💬1)

Github jupyter notebook trends

  • stable-diffusion-webui-colab: stable diffusion webui colab
  • openai-cookbook: Examples and guides for using the OpenAI API
  • whisper: Robust Speech Recognition via Large-Scale Weak Supervision
  • models: Models and examples built with TensorFlow
  • Machine-Learning-Goodness: The Machine Learning repository contains ML/DL projects, notebooks, cheat codes of ML/DL/AI, useful information on AI/AGI and codes or coding snippets/scripts/tasks.
  • stable-diffusion: A latent text-to-image diffusion model
  • google-research: Google Research
  • data-engineering-zoomcamp: Free Data Engineering course!
  • CLIP: CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
  • nerf: Code release for NeRF (Neural Radiance Fields)
  • DL-Algorithms: None
  • openvino_notebooks: 📚Jupyter notebook tutorials for OpenVINO™
  • mlops-zoomcamp: Free MLOps course from DataTalks.Club
  • Prompt-Engineering-Guide: 🐙Guides, papers, lecture, and resources for prompt engineering
  • stable_diffusion_chilloutmix_ipynb: AUTOMATIC1111 Stable Diffusion WebUI 1.5 + ChilloutMix + Kohya’s Scripts
  • notebooks: Notebooks using the Hugging Face libraries🤗
  • introtodeeplearning: Lab Materials for MIT 6.S191: Introduction to Deep Learning
  • learnopencv: Learn OpenCV : C++ and Python Examples
  • yolov7: Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
  • tutorials: MONAI Tutorials

Github python trends

  • bilingual_book_maker: Make bilingual epub books Using AI translate
  • text-generation-webui: A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
  • llama: Inference code for LLaMA models
  • llama_index: LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM’s with external data.
  • xiaogpt: play chatgpt with xiaomi ai speaker
  • 30-Days-Of-Python: 30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace.
  • researchgpt: An open-source LLM based research assistant that allows you to have a conversation with a research paper
  • langchain: ⚡Building applications with LLMs through composability⚡
  • tinygrad: You like pytorch? You like micrograd? You love tinygrad!❤️
  • ChuanhuChatGPT: GUI for ChatGPT API
  • fast-stable-diffusion: fast-stable-diffusion + DreamBooth
  • nebullvm: Plug and play modules to optimize the performances of your AI systems🚀
  • OpenBBTerminal: Investment Research for Everyone, Anywhere.
  • stable-diffusion-webui: Stable Diffusion web UI
  • diagrams: 🎨Diagram as Code for prototyping cloud system architectures
  • trl: Train transformer language models with reinforcement learning.
  • chat-langchain: None
  • PayloadsAllTheThings: A list of useful payloads and bypass for Web Application Security and Pentest/CTF
  • nicegui: Create web-based UI with Python. The nice way.
  • pandas: Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
  • DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Don't miss what's next. Subscribe to 5 minutes of Data Science:
GitHub X LinkedIn
Powered by Buttondown, the easiest way to start and grow your newsletter.