Innovation and Technology

AI Inference Chip Showdown

Published

2 months ago

April 6, 2025

Introduction to AI Inference Processing

Everyone is not just talking about AI inference processing; they are doing it. Analyst firm Gartner released a new report this week forecasting that global generative AI spending will hit $644 billion in 2025, growing 76.4% year-over-year. Meanwhile, MarketsandMarkets projects that the AI inference market is expected to grow from $106.15 billion in 2025 to $254.98 billion by 2030. However, buyers still need to know what AI processor to buy, especially as inference has gone from a simple one-shot run through a model to agentic and reasoning models that can increase computational requirements by some 100-fold.

Performance Continues to Skyrocket

For seven years, the not-for-profit group MLCommons has been helping AI buyers and vendors by publishing peer-reviewed quarterly AI benchmarks. It has just released its Inference 5.0 suite of results, with new chips, servers, and models. Let’s take a look.

The New Benchmarks

New benchmarks were added for the larger Llama 3.1 405B, Llama 2 70B with latency constraints for interactive work, and a new “R-GAT” benchmark for graph models. Only Nvidia ran benchmarks for all the models. A new benchmark was also added for edge inference, the Automotive PointPainting test for 3D object detection. There are now 11 AI benchmarks managed by MLCommons.

The New Chips

AI is built on silicon, and MLCommons received submissions for six new chips this round, including AMD Instinct MI325X (launched last Fall), Intel Xeon 6980P “Granite Rapids” CPU, Google TPU Trillium (TPU v6e) which has become generally available, Nvidia B200 (Blackwell), Nvidia Jetson AGX Thor 128 for AI at the Edge, and perhaps most importantly the Nvidia GB200, the beast that powers the NVL72 rack that has data centers scrambling to power and cool.

The New Results: Nvidia

As usual, Nvidia won all benchmarks; this time, they won by a lot. First, the B200 tripled the performance of the H200 platform, delivering over 59,000 tokens per second on the latency-bounded Llama 2 70B Interactive model. The new Llama 3.2 405B model is 3.4 times faster on Blackwell. Now for the real test: is the NVL72 as fast as Nvidia promised at launch? Yes, it is thirty times faster than the 8-GPU H200 running the new Llama 405B, but it has 9 times more GPUs.

Nvidia Performance

The new Llama 3.1 405B benchmark supports input and output lengths up to 128,000 tokens (compared to only 4,096 tokens for Llama 2 70B). The benchmark tests three distinct tasks: general question-answering, math, and code generation. But when you add Nvidia’s new open-source Dynamo “AI Factory OS” that optimizes AI at the data center level, AI factory throughput can double again running Llama and thirty times faster running DeeSeek.

And, Surprise, AMD Has Rejoined the MLPerf Party!

Welcome back, AMD! The new AMD MI325 did quite well at the select benchmarks AMD ran, competing admirably with the previous generation Hopper GPU. So, for AI practitioners who know what they are doing and don’t need the value of Nvidia software, AMD MI325 can save them a lot of money. AMD also did quite well at the Llama 3.1 405B Serving benchmark (distinct from the interactive 405B benchmark mentioned previously). AMD proudly said that Meta is now using the (older) MI300X as the exclusive inference server for the 405B model.

Conclusion

Nvidia retains the crown of AI King across all AI applications. Although competition is on the horizon, AMD delivers competitive performance only when measured against the previous Nvidia GPU generation. AMD expects that the MI350, due later this year, will close the gap. However, thanks to the GB300, Nvidia will retain the lead at the GPU performance level by then. But the real issue here is that while everyone else is trying to compete at the GPU level, Nvidia keeps raising the bar at the data center level with massive investments in software, solutions, and products to ease AI deployment and lower TCO.

FAQs

Q: What is the forecast for global generative AI spending in 2025?
A: Global generative AI spending is expected to hit $644 billion in 2025, growing 76.4% year-over-year.
Q: What is the projected growth of the AI inference market from 2025 to 2030?
A: The AI inference market is expected to grow from $106.15 billion in 2025 to $254.98 billion by 2030.
Q: What is the performance of Nvidia’s B200 chip compared to the H200 platform?
A: The B200 chip tripled the performance of the H200 platform, delivering over 59,000 tokens per second on the latency-bounded Llama 2 70B Interactive model.
Q: How does AMD’s MI325 chip perform compared to Nvidia’s Hopper GPU?
A: The AMD MI325 chip competes admirably with the previous generation Hopper GPU, and can save AI practitioners a lot of money if they don’t need Nvidia’s software.
Q: What is the significance of Nvidia’s Dynamo "AI Factory OS"?
A: Nvidia’s Dynamo "AI Factory OS" optimizes AI at the data center level, allowing for doubled throughput and lower TCO.

Innovation and Technology

Cyber Sovereignty Crossroads

Published

5 hours ago

May 22, 2025

WORxK Global News Team

Introduction to the Shifting Cybersecurity Landscape

The cybersecurity landscape is shifting fast, with the surge in AI adoption, rising demand for data sovereignty, and political turbulence rattling global trust in U.S.-based tech. Security leaders are being pulled in multiple directions, navigating a storm of change rather than steady waters. The question now is not how to stop disruption, but how to prepare for what comes next.

From Roadblocks to Runways: The AI U-Turn

Not long ago, the knee-jerk response to generative AI in the workplace was to ban it outright. CISOs blocked tools like ChatGPT, fearing data leaks, compliance violations, or worse. However, within months, organizations began walking back the bans and started asking how to use AI responsibly. Kevin Simzer, chief operating officer at Trend Micro, shared a firsthand experience where, at a CISO roundtable just nine months ago, every single participant was trying to block AI tools, but now 97% of them are leveraging AI. This shift underscores how fast attitudes are changing.

The conversation today is about sanctioned AI tools, corporate guardrails, and strategies for safe deployment. Companies realized the competitive cost of saying “no” to AI, as it can boost productivity, speed up decision-making, and automate grunt work. However, this also raises concerns about the talent pipeline, as AI threatens to hollow out the early-career ranks.

Automation’s Hidden Price: What Happens to the Talent Pipeline?

While AI promises efficiency, it also threatens to replace entry-level developers with code-generating bots, potentially hollowing out the early-career ranks. If AI filters out basic SOC alerts, where do future Tier 2 analysts come from? Simzer echoed this concern, citing Google’s transformation, where 25% of all code submitted into production in Q4 was AI-generated, and by the end of Q1 it was 30%. This raises questions about what happens when foundational learning experiences disappear.

The Rise of Data Sovereignty

A parallel shift is gaining momentum across the globe: data sovereignty. Countries and companies want more control over where their data lives and who can access it. This is not just about compliance but about national security and strategic independence. Organizations are rethinking whether they want sensitive data flowing through U.S. hyperscalers or stored in data centers subject to American jurisdiction. The demand for flexible deployment models, including on-prem solutions that can operate completely outside U.S. influence, is climbing sharply.

When Trust Wavers: The Global Fallout of U.S. Policy Chaos

The growing mistrust in U.S. government policy, including export bans and trade disputes, adds to the complexity. The MITRE CVE funding scare sent shockwaves through the security community, raising questions about who can be trusted to maintain digital infrastructure. Stories about the U.S. government having a “kill switch” capability for F-35 fighter jets sold to allies are prompting countries to reassess their tech dependencies.

Guardrails, Not Walls

Cybersecurity leaders are left in a moment of strategic reckoning. They cannot afford to say no to innovation but also cannot ignore the risks. The answer is balance, building systems that allow for AI adoption with transparency and oversight, and infrastructure that can flex between cloud and on-prem to meet sovereignty and compliance needs.

The Road Ahead: Resilience by Design

If 2023 was about waking up to disruption, then 2025 is about adapting to it. Resilience is no longer a buzzword but a survival trait. This means being agile enough to pivot when policies shift, architecting flexibility into the tech stack, and keeping a firm grip on who controls data, workflows, and destiny. Cybersecurity has always been about anticipating threats, but now it also has to be about anticipating change and being ready for whatever comes next.

Conclusion

The cybersecurity landscape is at an intersection of cloud infrastructure, AI code, and geopolitical maps, symbolizing the complex crossroads of innovation, sovereignty, and global risk. To navigate this landscape, organizations must be prepared to adapt, balance innovation with risk, and prioritize resilience by design.

FAQs

Q: What is the current state of AI adoption in the cybersecurity landscape?
A: The current state is one of rapid shift, with organizations moving from banning AI tools to embracing them with sanctioned use and corporate guardrails.
Q: How does AI adoption impact the talent pipeline in cybersecurity?
A: AI adoption threatens to hollow out early-career ranks by automating entry-level tasks, potentially eroding foundational skills.
Q: What is driving the demand for data sovereignty?
A: The demand is driven by concerns over national security, strategic independence, and compliance, prompting organizations to seek more control over their data.
Q: How can cybersecurity leaders navigate the geopolitical complexities and mistrust in U.S. policy?
A: By prioritizing flexibility, transparency, and oversight in their tech stack and deployment models, and staying informed about global policy shifts.
Q: What does resilience by design mean in the context of cybersecurity?
A: It means architecting systems and processes to be agile, adaptable, and responsive to change, ensuring the ability to pivot when policies shift or new threats emerge.

Innovation and Technology

The Impact of AI on Business: A Guide to the Future

Published

7 hours ago

May 22, 2025

WORxK Global News Team

The Impact of AI on Business: A Guide to the Future

AI and automation for impact are revolutionizing the business world, transforming the way companies operate, make decisions, and interact with customers. In this comprehensive guide, we’ll explore the current state of AI in business, its benefits and challenges, and what the future holds for this technology. From enhanced efficiency to improved customer experiences, AI is poised to drive significant growth and innovation in the years to come.

Understanding AI and Its Applications in Business

AI refers to the development of computer systems that can perform tasks that typically require human intelligence, such as learning, problem-solving, and decision-making. In business, AI is being used in a variety of ways, including chatbots, predictive analytics, and process automation. These applications are enabling companies to streamline operations, reduce costs, and improve customer engagement.

Types of AI Used in Business

There are several types of AI being used in business, including narrow or weak AI, general or strong AI, and superintelligence. Narrow AI is designed to perform a specific task, such as image recognition or language translation, while general AI is more advanced and can perform a wide range of tasks. Superintelligence refers to AI that is significantly more intelligent than humans and has the potential to revolutionize numerous industries.

Benefits of AI in Business

The benefits of AI in business are numerous and well-documented. AI can help companies improve efficiency, reduce costs, and enhance customer experiences. It can also provide valuable insights and enable data-driven decision-making. Additionally, AI can help businesses stay competitive and adapt to changing market conditions.

The Current State of AI in Business

AI is being used in a variety of industries, including healthcare, finance, and retail. In healthcare, AI is being used to analyze medical images, diagnose diseases, and develop personalized treatment plans. In finance, AI is being used to detect fraud, predict market trends, and optimize investment portfolios. In retail, AI is being used to personalize customer experiences, optimize supply chains, and predict demand.

Challenges and Limitations of AI in Business

While AI has the potential to revolutionize numerous industries, there are also challenges and limitations to its adoption. One of the biggest challenges is the need for high-quality data, which can be difficult to obtain and integrate. Additionally, AI systems can be biased and require significant expertise to develop and implement. Furthermore, there are concerns about job displacement and the potential for AI to exacerbate existing social inequalities.

Real-World Examples of AI in Business

There are numerous real-world examples of AI in business, including Amazon’s use of AI-powered chatbots to provide customer support, Walmart’s use of AI to optimize its supply chain, and Google’s use of AI to improve its search engine results. These examples demonstrate the potential of AI to drive significant growth and innovation in business.

The Future of AI in Business

The future of AI in business is exciting and rapidly evolving. As AI technology continues to advance, we can expect to see even more innovative applications and use cases. One area that holds significant promise is the use of AI in conjunction with other emerging technologies, such as blockchain and the Internet of Things (IoT). This convergence of technologies has the potential to create new business models, products, and services that we cannot yet imagine.

Emerging Trends and Technologies

There are several emerging trends and technologies that are likely to shape the future of AI in business. These include the use of edge AI, which enables AI processing to occur at the edge of the network, reducing latency and improving real-time decision-making. Another trend is the use of transfer learning, which enables AI models to be trained on one task and applied to another, reducing the need for large amounts of training data.

Preparing for the Future of AI in Business

To prepare for the future of AI in business, companies need to develop a strategic plan that takes into account the potential benefits and challenges of AI. This plan should include investments in AI research and development, as well as employee training and education. Additionally, companies need to ensure that they have the necessary infrastructure and data management systems in place to support AI adoption.

Conclusion

In conclusion, AI and automation for impact are transforming the business world, enabling companies to improve efficiency, reduce costs, and enhance customer experiences. While there are challenges and limitations to AI adoption, the benefits are numerous and well-documented. As AI technology continues to advance, we can expect to see even more innovative applications and use cases. By understanding the current state of AI in business, its benefits and challenges, and what the future holds, companies can prepare for the significant growth and innovation that AI is poised to drive.

Frequently Asked Questions (FAQs)

What is AI and how is it used in business?

What are the benefits of AI in business?

What are the challenges and limitations of AI in business?

How can companies prepare for the future of AI in business?

What is the future of AI in business?

Innovation and Technology

AI Revolution in the Workplace

Published

14 hours ago

May 22, 2025

WORxK Global News Team

As AI matures, the availability of so-called “digital labor” is exploding, expanding the very definition of a qualified workforce. What was once the exclusive domain of human talent has now been joined by AI agents capable of handling many tasks once considered beyond the reach of automation—and as a result, according to Salesforce CEO Marc Benioff, the total addressable market for digital labor could soon reach the trillions of dollars.

The Evolution of Automation

The evolution of automation has been a significant factor in the growth of digital labor. Tasks that were once thought to require human intelligence and expertise can now be performed by AI agents with a high degree of accuracy and speed. This has opened up new possibilities for businesses and organizations, allowing them to automate tasks that were previously manual and time-consuming.

Impact on the Workforce

The rise of digital labor is likely to have a significant impact on the workforce. As AI agents take over routine and repetitive tasks, human workers will be freed up to focus on more complex and creative tasks that require a higher level of skill and expertise. This could lead to the creation of new job opportunities and the development of new industries that we cannot yet imagine.

Benefits of Digital Labor

The benefits of digital labor are numerous. For businesses, it can increase efficiency and productivity, reduce costs, and improve customer service. For workers, it can provide new opportunities for skill development and career advancement. For society as a whole, it can help to drive economic growth and innovation.

Challenges and Limitations

However, there are also challenges and limitations to the growth of digital labor. One of the main challenges is the need for significant investment in AI technology and infrastructure. There is also a risk that the rise of digital labor could exacerbate existing social and economic inequalities, particularly if certain groups are left behind in the transition to an automated workforce.

Future of Digital Labor

Despite these challenges, the future of digital labor looks bright. As AI technology continues to evolve and improve, we can expect to see even more tasks and industries being automated. This will require workers to develop new skills and adapt to new ways of working, but it also has the potential to drive significant economic growth and innovation.

Total Addressable Market

According to Salesforce CEO Marc Benioff, the total addressable market for digital labor could soon reach the trillions of dollars. This is a significant opportunity for businesses and investors, and it highlights the potential for digital labor to drive economic growth and innovation in the years to come.

Conclusion

In conclusion, the rise of digital labor is a significant trend that is likely to have a major impact on the workforce and the economy. As AI technology continues to evolve and improve, we can expect to see even more tasks and industries being automated. While there are challenges and limitations to the growth of digital labor, the benefits are numerous and the potential for economic growth and innovation is significant.

FAQs

Q: What is digital labor?
A: Digital labor refers to the use of AI agents and automation to perform tasks that were previously done by humans.
Q: What are the benefits of digital labor?
A: The benefits of digital labor include increased efficiency and productivity, reduced costs, and improved customer service.
Q: What are the challenges and limitations of digital labor?
A: The challenges and limitations of digital labor include the need for significant investment in AI technology and infrastructure, and the risk that it could exacerbate existing social and economic inequalities.
Q: What is the total addressable market for digital labor?
A: According to Salesforce CEO Marc Benioff, the total addressable market for digital labor could soon reach the trillions of dollars.
Q: How will digital labor change the workforce?
A: Digital labor will change the workforce by automating routine and repetitive tasks, and freeing up human workers to focus on more complex and creative tasks.