AI Diaries: Weekly Updates #10

Esra Kuzu
Sep 29, 2024
6 min read

Welcome to this month's edition of AI Diaries: Weekly Updates! This week’s AI Diaries highlights some of the latest advancements and innovations in the world of AI.

AMD has introduced AMD-135M, a small language model optimized for its MI250 GPUs, designed for text generation and language comprehension tasks, marking a milestone in AI hardware innovation. PyTorch has launched torchao, a native library that enhances model performance and efficiency through advanced techniques like quantization and sparsity, offering up to 97% speedup in inference for deep learning models. Researchers from Universitat Politècnica de Catalunya have compared PyTorch optimization techniques in a study focused on energy efficiency in machine learning, revealing that dynamic quantization significantly reduces energy use while maintaining model accuracy. ReliabilityBench has unveiled a new tool for measuring the performance variability of large language models across key human cognitive domains, addressing unpredictability in real-world AI applications. RxEnvironments.jl introduces a reactive programming framework for simulating complex agent-environment interactions, paving the way for more advanced AI models in simulations. Lastly, torchao has also integrated low-bit optimizers and quantization techniques, contributing to a significant reduction in resource consumption for models like Llama 3, demonstrating its versatility in modern AI applications.

These stories offer valuable insights and showcase the remarkable progress being made in technology and AI.

Enjoy the read, and we invite you to share your thoughts in the comments below!

Let’s dive in.

Optimizing Energy Efficiency in Machine Learning: Evaluating PyTorch Optimization Techniques for Sustainable AI Development

TL;DR: A study by researchers from Universitat Politècnica de Catalunya evaluated PyTorch optimization techniques—dynamic quantization, pruning, and torch.compile—to improve energy efficiency in image classification tasks. Dynamic quantization was the most effective, significantly reducing inference time and energy use. Torch.compile provided a good balance between accuracy and energy efficiency, while local pruning showed little benefit.

What's The Essence?: This study focuses on optimizing the energy efficiency of machine learning models in PyTorch, especially during inference, which accounts for 90% of ML costs. Techniques like dynamic quantization, pruning, and torch.compile were compared to reduce energy consumption while maintaining performance.

How Does It Tick?: Dynamic quantization emerged as the most energy-efficient method, especially for smaller models, reducing both inference time and energy use. Torch.compile balanced accuracy and energy savings, while global pruning helped at 25%. Local pruning, however, did not improve performance or efficiency. These methods were evaluated on 42 models using popular datasets like ImageNet and CIFAR-10.

Why It Matters?: With rising concerns about the environmental impact of machine learning, optimizing models for energy efficiency is critical. This study highlights how specific PyTorch techniques can cut energy consumption and costs, offering practical strategies for developers focused on building sustainable AI models without sacrificing accuracy.

The Paper: https://arxiv.org/abs/2409.12878

---

MD Launches AMD-135M: A New AI Model to Boost Performance and Efficiency in Language Processing

AlphaProteo — Figure: AMD-135M Model Performance Versus Open-sourced Small Language Models on Given Tasks

TL;DR: AMD has launched its first small language model, AMD-135M, with 135 million parameters, optimized for AMD MI250 GPUs. Built on the LLaMA2 architecture, the model is pretrained on extensive datasets like SlimPajama and Project Gutenberg, excelling in text generation and language comprehension tasks. It’s easily deployable using Hugging Face's Transformers library.

What's The Essence?: AMD-135M is AMD’s debut in language models, boasting powerful architecture optimized for their hardware, specifically designed for natural language processing and text generation tasks. It stands out by offering a robust, efficient solution in the AI model space.

How Does It Tick?: The model uses 12 attention layers, a 768 hidden size, and advanced features like RMSNorm and RoPE embeddings. Pretrained on datasets like SlimPajama and Project Gutenberg, it's fine-tuned for better performance on tasks such as code generation, with ease of deployment through Hugging Face.

Why It Matters?: This release signals AMD’s entry into the competitive AI landscape, offering a highly efficient model that leverages its hardware capabilities. AMD-135M is poised to be a strong alternative in the growing language model market, particularly for developers and researchers looking for accessible AI solutions optimized for specific applications like text and code generation.

Check Out The Model: https://www.amd.com/en/developer/resources/technical-articles/introducing-amd-first-slm-135m-model-fuels-ai-advancements.html?utm_source=organic&utm_medium=community&utm_campaign=blog&utm_id=amd35

---

AI Model TxGNN Repurposes Existing Drugs for Rare Diseases

A representative image generated by Kling AI

TL;DR: Harvard researchers have developed an AI model, TxGNN, to repurpose existing drugs for rare diseases. This tool identified drug candidates for over 17,000 diseases, many without treatments, offering a faster and cost-effective way to discover therapies. The model also predicts potential side effects and provides explanations for its decisions.

What's The Essence?: TxGNN is an AI-powered tool designed to discover new treatments for rare and neglected diseases by repurposing existing drugs. It analyzes massive datasets to predict drug efficacy and side effects, providing insights that can lead to quicker therapeutic development.

How Does It Tick?: Trained on vast biomedical data, the AI identifies shared disease mechanisms and predicts drug candidates for thousands of conditions. It goes beyond existing models by handling a larger scope of diseases and offering rationale for its predictions, enhancing trust and transparency for clinicians.

Why It Matters?: With over 93% of rare diseases lacking treatments, TxGNN accelerates drug discovery, potentially closing health disparities. Repurposing existing drugs is faster and more economical than developing new ones, offering new hope for millions of patients worldwide.

For More: https://news.harvard.edu/gazette/story/2024/09/using-ai-to-repurpose-existing-drugs-for-treatment-of-rare-diseases/

---

Brazilian Startup RadarFit Uses AI to Promote Healthier Lifestyles

TorchGeo 0.6.0 Released by Microsoft: Helping Machine Learning Experts to Work with Geospatial Data — Users earn points that can be redeemed for prizes by uploading photos of healthy meals which are reviewed by AI with oversight by RadarFit’s stable of human subject-matter experts. Photo by Avener Prado.

TL;DR: RadarFit, a Brazilian startup founded by women, uses AI-powered gamification to motivate individuals to adopt healthier habits. The app, supported by Microsoft Azure’s AI services, tracks users’ activities, offers personalized health advice, and awards points for healthy choices. It has over a million users and helps companies reduce healthcare costs while encouraging employees to take better care of themselves.

What's The Essence?: RadarFit leverages AI to promote healthier lifestyles through gamification, offering personalized wellness plans to individuals and corporate clients. The app incentivizes users by awarding points for healthy actions, which can be redeemed for prizes or donated to causes, making health management fun and engaging.

How Does It Tick?: Powered by Microsoft Azure and OpenAI, RadarFit uses AI to analyze user data, suggest personalized actions, and automate meal and activity reviews. The app gamifies health choices with a points system, ensuring that users stay motivated. AI’s role streamlines tasks like nutrition tracking and adds custom features like meditation.

Why It Matters?: RadarFit not only helps individuals improve their health but also provides companies with a cost-effective wellness solution that reduces healthcare expenses. Its AI-driven approach makes healthy living more accessible and personalized, while promoting a culture of well-being in workplaces and communities.

For More: https://news.harvard.edu/gazette/story/2024/09/using-ai-to-repurpose-existing-drugs-for-treatment-of-rare-diseases/

---

AI Tool Magic Notes Revolutionizes Social Work in England

TL;DR: Social workers in England are using an AI system called Magic Notes to record meetings, draft letters, and suggest follow-up actions. The tool helps reduce paperwork and free up time for face-to-face work, but raises concerns about AI's influence on decision-making. Human oversight remains crucial, and there are calls for AI regulation in social care.

What's The Essence?: Magic Notes is an AI tool designed to assist social workers by recording and analyzing conversations, generating summaries, and suggesting actions. It aims to reduce administrative burdens and improve workflow while ensuring that decisions remain in human hands.

How Does It Tick?: The AI uses speech-to-text and natural language processing to record meetings, draft summaries, and suggest follow-up actions like writing letters to doctors. Social workers review and approve the AI-generated content before it’s filed, ensuring that human judgment is the final authority.

Why It Matters?: With high vacancy rates in social care, AI tools like Magic Notes can ease workloads, save time, and potentially reduce costs. However, concerns about AI’s role in sensitive decision-making and data privacy underscore the need for ethical guidelines and oversight in the use of AI in public services.

For More: https://www.theguardian.com/society/2024/sep/28/social-workers-england-ai-system-magic-notes?CMP=share_btn_url

---

AI Uncovers Hidden Nazca Geoglyphs in Peru's Desert

TL;DR: Archaeologists using AI discovered 303 new geoglyphs in Peru’s Nazca Desert, nearly doubling the number of known symbols. These giant carvings, dating back 2,000 years, include figures of birds, humans, and animals. AI accelerated the discovery process, identifying promising locations for fieldwork.

A geoglyph discovered in Nazca, Peru, with an annotated version to highlight the image. Photograph: University of Yamagata

What's The Essence?: AI was instrumental in identifying new Nazca geoglyphs, revealing hundreds of ancient symbols previously hidden in the desert. The collaboration between archaeologists and AI highlights how technology is transforming the pace and scope of archaeological discoveries.

How Does It Tick?: An AI model trained with high-resolution images of existing geoglyphs was used to scan large areas of the desert. It identified thousands of potential sites, which researchers then confirmed through fieldwork, aided by drones. This AI-human teamwork enabled the discovery of hundreds of new symbols.

Why It Matters?: This breakthrough almost doubles the number of known Nazca geoglyphs and demonstrates the game-changing role AI can play in archaeology. It not only helps researchers discover ancient sites faster but also opens doors to better understanding the Nazca culture and the purpose of these mysterious symbols.

For More: https://www.theguardian.com/world/2024/sep/26/nazca-lines-peru-new-geoglyphs

If you've read this far, you're amazing! 🌟 Keep striving for knowledge and continue learning! 📚✨