Hugging Face stories
DataStax is poised to unveil its cutting-edge AI technology at RAG++ Sydney, claiming to enhance RAG application development by 100 times.
Intel has unveiled Xeon 6 processors and Gaudi 3 AI accelerators, aiming to boost AI performance and efficiency. Industry giants like Dell and IBM back the launch.
Amazon Web Services has launched its AWS Trainium2-powered EC2 instances, offering improved performance for training large AI models at lower costs.
Elastic has unveiled its AI Ecosystem, aiming to accelerate the development of Retrieval Augmented Generation applications for enterprise developers.
Red Hat launches OpenShift 4.17 with AI, edge, cloud and security upgrades, enhancing hybrid cloud, model management, and edge computing capabilities.
Endor Labs launches Endor Scores for AI Models, enabling developers to evaluate the security and quality of open source AI models on Hugging Face.
Dell Technologies has expanded its AI solutions portfolio with five new PowerEdge servers featuring AMD's latest EPYC processors, enhancing enterprise performance.
Dell Technologies is expanding its AI Factory with new PowerEdge servers powered by AMD's 5th Generation EPYC processors to boost enterprise AI adoption.
Black Forest Lab has enhanced its FLUX.1 AI model suite with NVIDIA TensorRT, boosting performance by up to 20% for high-quality image generation.
Teradata unveils enhanced capabilities for VantageCloud Lake, enabling organisations to deploy generative AI via open LLMs, boosting efficiency and ROI.
Elastic has teamed up with Hugging Face to integrate its models into Elasticsearch's Open Inference API, streamlining generative AI development for developers.
Oracle has rolled out new OCI Generative AI Agents, incorporating RAG capabilities and enhanced AI features, aiming to simplify AI applications in business operations.
Oracle unveils GenDev infrastructure to revolutionise AI app development, leveraging Oracle Database 23ai, Autonomous Database, and new affordable pricing plans.
NVIDIA launches the compact Mistral-NeMo-Minitron-8B-Base, combining pruning and distillation for high accuracy AI on RTX-powered workstations and edge devices.
DataStax to reveal major GenAI platform updates at RAG++ in San Francisco, promising 100x faster RAG-powered app development with new tools and integrations.
Cloudera has unveiled three AI-driven assistants to speed up data and AI business apps, as 84% of Asia-Pacific firms embrace AI for business impact.
At Computex 2024, AMD CEO Dr Lisa Su unveiled an aggressive expansion of their Instinct accelerator roadmap, featuring the MI325X, MI350, and MI400 series aimed at revolutionising AI in data centres.
At COMPUTEX 2024, NVIDIA CEO Jensen Huang has unveiled more details of the development building blocks they're calling NVIDIA NIMs.
At IBM's annual THINK conference, major updates to the watsonx platform were announced, including new open-source AI models, tools, and data capabilities, set to redefine enterprise AI.
Alibaba Cloud's Apsara Conference 2024 in Hangzhou spotlights substantial AI and cloud advancements, including Qwen 2.5 models and partnerships with NVIDIA and UNESCO.