Llama 3.1 405 billion Parameter Released

Llama 3.1 405B is the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation. With the release of the 405B model, Meta supercharges innovation—with unprecedented opportunities for growth and exploration. They believe the latest generation of Llama will …

Read more

Jensen and Elon Both See Future with Billions of Humanoid Robots

Jensen Huang, Nvidia CEO, says humanoid robots will become as common as cars and there will be significant breakthroughs in robotics in the next 2-3 years. This would be about 2 billion humanoid robots. Elon Musk said he thinks there will be ten times as many humanoid robots in the future. This would be 20 …

Read more

Tesla Nvidia AI, Dojo and FSD Spending

Of the roughly $10 billion in AI-related expenditures, Elon said Tesla would make this year, about half is internal, primarily the Tesla-designed AI inference computer and sensors present in all of our cars, plus Dojo. For building the AI training superclusters, NVidia hardware is about 2/3 of the cost. Elon Musk’s current best guess for …

Read more

Nvidia Helps You to Build Lifelike Digital Humans to Transform Industries

NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs, these inference microservices enable developers to deliver high-quality natural language understanding, speech synthesis, and facial animation for gaming, customer service, healthcare, and more. NVIDIA is also introducing ACE PC NIM microservices for deployment …

Read more

Groq 30 Days to Starting With Large Customers

Groq said that they will start operating an AI inference cluster with large business in 30 days. Groq made a presentation at the GenAI summit 2024 in San Francisco. They are processing 30,000 inference input inference tokens and will put together about 1500 chips into an inference data center that will process 25 million inference …

Read more

Leaders in Self Driving Cars Say Tesla is by Far the Leader

May 22, Nvidia Jensen Huang says Tesla is far ahead in self-driving cars and on May 15, Xu Baoqiang, general manager of Baidu’s autonomous driving vehicle department, said in an interview that Baidu is considering potential collaboration opportunities with Tesla for the latter’s upcoming Robotaxi service. Nvidia is supply self driving car companies and robotaxi …

Read more

Looking at Hardware for Running Local Large Language Models

ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, images, or other data. Leveraging retrieval-augmented generation (RAG), TensorRT-LLM, and RTX acceleration, you can query a custom chatbot to quickly get contextually relevant answers. It all runs locally on your Windows RTX PC or …

Read more