AI Infrastructure - SkrewAI (Page 2)

Meta Secures Millions of Nvidia AI Chips in Major Deal

Meta has struck a major deal with Nvidia to acquire millions of next-generation AI chips, dramatically expanding its compute infrastructure for AI model training and deployment.

LLM Tools

Building AI Research Pipelines with LM Studio and NotebookLM

Learn how to combine local LLM deployment via LM Studio with Google's NotebookLM to create a powerful, privacy-preserving AI research workflow for document analysis and synthesis.

ByteDance

ByteDance Developing Custom AI Chips with Samsung

TikTok parent ByteDance is reportedly developing proprietary AI chips and in talks with Samsung for manufacturing, signaling major vertical integration in AI infrastructure.

Hugging Face

Hugging Face Transformers v5: Simplified APIs for AI Development

Hugging Face releases Transformers v5 with cleaner APIs, unified model loading, and breaking changes that simplify building AI applications across text, image, and video domains.

LLM Research

Sketch-and-Walk: New Sparse Attention Method Speeds Up LLM Infere

Researchers propose a two-phase sparse attention mechanism that scouts relevant tokens before full computation, promising significant efficiency gains for large language model inference.

AI Hardware

Positron Raises $230M to Challenge Nvidia's AI Chip Dominance

Startup Positron secures $230M Series B to develop alternative AI chips, potentially reshaping the hardware landscape that powers video generation and synthetic media systems.

AI Infrastructure

Marvell Closes $3.25B Celestial AI Deal for Photonics Tech

Marvell Technology completes its $3.25 billion acquisition of Celestial AI, bringing advanced photonics interconnect technology in-house to address AI data center bandwidth demands.

OpenAI

Nvidia Eyes Largest-Ever Investment in OpenAI Funding Round

Nvidia reportedly preparing its largest startup investment ever by joining OpenAI's funding round, deepening the symbiotic relationship between the GPU giant and leading AI lab.

LLM Security

GPU Soft Errors Threaten LLM Reliability: Fault Injection Study

New research reveals how GPU hardware faults can silently corrupt LLM outputs. Instruction-level fault injection exposes critical vulnerabilities in AI inference systems.

AI Hardware

DABench-LLM: New Framework Benchmarks Post-Moore AI Accelerators

Researchers introduce DABench-LLM, a standardized framework for evaluating dataflow AI accelerators designed for large language model inference in the post-Moore era.

Multimodal AI

MMR-Bench: New Benchmark Tests AI Model Routing for Multimodal Ta

Researchers introduce MMR-Bench, a comprehensive benchmark evaluating how well routing systems direct queries to optimal multimodal LLMs across diverse visual reasoning tasks.

Microsoft

Microsoft Unveils Maia 200 AI Chip to Challenge Amazon and Google

Microsoft announces its Maia 200 custom AI accelerator, entering direct competition with Amazon and Google in the race to build proprietary silicon for AI workloads.