Alluxio Blog

Maximize GPU Utilization for Model Training

April 3, 2024 By Hope Wang

GPU utilization or GPU usage, is the percentage of GPUs’ processing power being used at a particular time. As GPUs are expensive resources, optimizing their utilization and reducing idle time is essential for enterprise AI infrastructure. This blog explores bottlenecks hindering GPU utilization during model training and provides solutions to maximize GPU utilization. 1. Why … Continued

IWD 2024: Empower Women Developers in the Open-Source Community

March 29, 2024 By Hope Wang

This article was originally published on ITBrief. The author is Hope Wang, Developer Advocate, Alluxio. As we celebrate International Women’s Day, it is important to reflect on the progress we have made toward gender equality in the tech industry, particularly in open-source software (OSS). While there is still much work to be done, I am … Continued

Accelerating Data Loading in Large-Scale ML Training With Ray and Alluxio

January 23, 2024 By Lu Qiu, Chunxu Tang and Beinan Wang

In the rapidly-evolving field of artificial intelligence (AI) and machine learning (ML), the efficient handling of large datasets during training is becoming more and more pivotal. Ray has emerged as a key player, enabling large-scale dataset training through effective data streaming. By breaking down large datasets into manageable chunks and dividing training jobs into smaller … Continued

The Best Content of 2023 – Our Favorite Things

January 23, 2024 By Chenjia Guo

2023 is over, so we’ve compiled a collection of 2023’s most popular content according to our readers. In case you missed anything, here’s your chance to catch up on best practices ebooks, technical blogs, hands-on videos, webinars and more. Enjoy! ALL THINGS AI Building High-performance Data Access Layer for Model Training and Model Serving for … Continued

Setting the Stage for Alluxio Community to Soar in the Year of the Dragon: 2023 Recap and 2024 Outlook

January 9, 2024 By Hope Wang, Chanchan Mao, Bin Fan, Shouwei Chen, Tango Tian, Tianyu Wang, Shun Lv and Allan Sha

As we step into 2024, we look back and celebrate an incredible year of 2023 for the Alluxio community. First and foremost, thank you to all of our contributors and the broader community! Together, we have achieved remarkable milestones. 💖 📈 Highlights by Numbers Let’s take a look at the Alluxio in 2023 by numbers. … Continued

A Journey Towards Data Locality on Cloud for Machine Learning and AI

December 18, 2023 By Lu Qiu and Shawn Sun

In this blog, we discuss the importance of data locality for efficient machine learning on the cloud. We examine the pros and cons of existing solutions and the tradeoff between reducing costs and maximizing performance through data locality. We then highlight the new-generation Alluxio design and implementation, detailing how it brings value to model training … Continued

Beyond the Hype: 10 Core Principles for AI Success

December 13, 2023 By Omid Razavi

This article was initially posted on datanami. The paradigm shift ushered in by Artificial Intelligence (AI) in today’s business and technological landscapes is nothing short of revolutionary. AI’s potential to transform traditional business models, optimize operations, and catalyze innovation is vast. But navigating its complexities can be daunting. Organizations must understand and adhere to some foundational … Continued

Why Adding NAS/NFS on Object Storage May not Solve Your Data Access Problem of AI

November 28, 2023 By Tarik Bennett, Beinan Wang and Hope Wang

In this blog, we discuss the data access challenges in AI and why commonly used NAS/NFS may not be a good option for your organization. 1. Early Architecture of AI/ML According to Gartner, although LLMs are on the hype, most organizations are in the early stages, with some in production. In the early stages of … Continued

AI Infra Day Sessions Recap

November 16, 2023 By Chenjia Guo

Alluxio, the data platform company for all data-driven workloads, hosted the community event “AI Infra Day” on October 25, 2023. This virtual event brought together technology leaders working on AI infrastructure from Uber, Meta, and Intel, to delve into the intricate aspects of building scalable, performant, and cost-effective AI platforms. Bin Fan, Alluxio’s Chief Architect … Continued