Tag: IT Infrastructure

Race for Artificial Intelligence (AI) Infrastructure: Navigating the Best Path to Supercharge Your AI Strategy | Blog

As we stand on the brink of a new technological era, the rise of AI is reshaping our interactions with the digital world.  

The rapid proliferation of AI has intensified the demand for scalable, high-performance computing resources, in the process exposing the limitations of traditional infrastructure.  

Enterprises are now seeking significant upgrades and expansions to their traditional information technology (IT) infrastructure, in order to keep up with the rising demands of AI workloads.  

This has since driven considerable investment into specialized AI infrastructure and tools and services, that can now create the necessary environment for core hardware and infrastructure components to operate at their best.

  • According to Everest Group research, 81% of enterprises plan to allocate 50% or more of their infrastructure budget this year to upgrading capabilities specifically for AI

Managing Investments in the Face of Rising AI Demands and Evolving landscape 

To accelerate AI development and maintain an edge in the evolving digital landscape, enterprises are increasingly investing in core hardware and infrastructure components. This had led to the surge in demand for high-performance critical computer hardware, networking, and storage infrastructure necessary for AI computations and data management including (Graphics Processing Units) GPUs, (Tensor Processing Units) TPUs, and Virtual Storage Platforms (VSP).  

  • As per Everest Group research, 46% of enterprises prioritize upgrading computing power such as, graphics processing units (GPUs), central processing units (CPUs), and tensor processing units (TPUs), as one of their top three priorities in AI infrastructure investments

Reach out to discuss this topic in depth. 

Providers are now significantly increasing investments to upgrade their supply and secure their positions in a rapidly evolving marketplace.  

They are adopting multifaceted strategies to differentiate themselves, secure market share, and address the evolving needs of enterprises for their AI needs. As the market transforms, leading players are making bold strides in the AI arena: 

As Nvidia rides the AI wave, AMD battles to disrupt its market dominance in the GPU market 

Nvidia, known for its high-end graphics cards for gaming personal computers (PCs), has now crossed US$3 trillion in market cap, owing to the rising demand for its AI chips, critical for advanced AI infrastructure. As Nvidia stands at the forefront of the AI infrastructure market, its GPUs are indispensable for training and deploying sophisticated AI models, including OpenAI’s ChatGPT, leading to its market dominance in the GPU sector.  

While Nvidia remains a dominant force in the AI field, other competitors are gradually emerging, aiming to gain market share and driving innovation to break Nvidia’s dominance.  

AMD presents a significant challenge to Nvidia in the GPU sector and is working on providing compelling alternatives, particularly for budget-conscious buyers. AMD’s MI300 chip has gained substantial traction amongst startups, as well as with technology giants like Microsoft. It is also constantly investing in this space to bolster its position, as evidenced by its recent multi-billion-dollar acquisition of ZT Systems.

Intel – the computing giant facing challenges, but could that change soon with Gaudi 3? 

Intel, traditionally focused on CPUs, has faced challenges in gaining a strong foothold in the GPU market and has been facing stiff competition from competitors, with Nvidia surpassing Intel in annual revenue 

Intel is now intensifying its efforts to close the gap in the AI market. At the recent Intel Vision event, Intel highlighted the forthcoming release of Gaudi 3, an AI accelerator, claiming to be able to outperform Nvidia’s powerful H100 GPU in training large language models (LLMs).  

Intel also stated that the Gaudi 3 could deliver similar or even superior performance compared to Nvidia’s H200 for large language model inferencing. Additionally, it claims that Gaudi 3 is focused on reducing energy consumption and has greater power efficiency than the H100, for specific use cases.  

Intel’s strategic push to challenge Nvidia’s dominance occurs against a backdrop of persistent shortages in AI accelerator chips, which has created substantial obstacles for tech companies. 

Hyperscalers – Nvidia’s largest customers today, potential rivals tomorrow? 

Major cloud providers such as Google, Microsoft, Amazon, and Oracle, who together contribute significantly to Nvidia’s revenue, are making a strategic shift toward developing their own processors and in-house chips, to reduce dependency on Nvidia’s GPUs, as well as to drive their own innovation.  

Amazon has been rolling out its AI-focused Inferentia and Tranium chips for AI inference and training, offering these through Amazon web services (AWS), as cost-effective alternatives to Nvidia’s products.  

Google, a long-time advocate of its Tensor Processing Units (TPUs), recently introduced Trillium, its sixth generation TPU to power its AI models, which it claims is 5 times faster than its predecessor.  

Microsoft is also making strides by developing its own AI processors and chips, including the Cobalt 100 CPU, an arm-based processor used for running general purpose computer workloads on the Microsoft Cloud and Maia 100 AI Accelerator. 

Emergence of new players and trailblazing startups Disrupting the AI landscape with innovative approaches? 

Several startups are making significant strides within the AI infrastructure landscape, with their innovative approaches.  

Cerebras Systems, known for its Wafer-Scale Engine (WSE) designed for high-performance AI workloads, has recently introduced an AI inference service that it claims to be the fastest in the world.

Groq’s Language Processing Unit (LPU) stands out for its high speed in AI inference tasks, offering substantial performance gains for large language models. Groq has also recently raised $640 million for its AI chips.

Groq’s rival SambaNova, has also launched its AI inference platform SambaNova cloud. Similarly other startups like Blaize, an AI chip maker, is developing competitive AI chip technology, with its own unique focus and specialization.  

Although Nvidia holds a dominant position, Groq, Cerebras Systems, SambaNova, and other startups are emerging as serious contenders in the marketplace, offering innovative and competitive solutions. It will now be interesting to see how the new players in this space can challenge the technological giants. 

AI chips and accelerators 1

Exhibit 1: AI chips and accelerators landscape 

How to take the next steps? 

As the AI landscape continues to evolve, challenges remain, as enterprise demand for GPUs exceeds supply, leading to a shortage.  

This imbalance, combined with high demand, has also driven up GPU prices, making it challenging to find affordable alternatives. As a result, organizations are increasingly exploring alternatives to the dominant players in the AI chip and accelerators market.  

But, with so many options, it’s crucial for organizations to carefully evaluate their requirements, budget, and strategic goals, to choose the most suitable options for leveraging AI power effectively. We suggest a two-pronged approach to align organizational AI strategy: 

Assess and analyze

Assess your requirements on parameters such as:  

  • Organizational capabilities and budgetary flexibility: Assess which strategy would suit your budget – purchasing or renting GPUs. Weigh in the initial investment needed, maintenance costs, and long-term operational savings
  • AI current workload requirements: Analyze your requirements based on the types of AI workloads and business use-cases (e.g., is your need centered around high-performance training or low latency inference or both)
  • Future adaptability: Consider whether your AI workloads may evolve, necessitating reconfigurable hardware or if the efficiency of specialized chips is more important
  • Power and space: Assess your organization’s energy efficiency, hardware footprint, and power consumption needs

Align and augment

After the initial assessment and once you have a clear understanding of your AI requirements, develop a roadmap that supports your AI strategy, taking the 5 S into consideration – Scalability, Sustainability, Security, Simplicity, and Stability 

  • Ensure your AI strategy is directly aligned with business objectives, such as innovation, operational efficiency, or scaling products, while also being adaptable to future AI workloads
  • Augment existing AI infrastructure by partnering with the right vendors that can help you meet your AI workload demands

Slide2 1 

Exhibit 2: By adopting this two-pronged approach, you can effectively chart the best path to supercharge your AI strategy.  

If you found this blog interesting, check out our report, Navigating AI Infrastructure: The Backbone of the AI-Driven Era.

If you have any questions, would like to gain expertise in artificial intelligence, or would like to reach out to discuss these topics in more depth, contact Praharsh Srivastava, Zachariah Chirayil, and Tanvi Rai.

Always on Call: How to Avoid an IT Meltdown | In the News

For enterprise leaders hoping to ensure their companies can respond to outages whenever they happen, there are some essential tactics to execute, according to Mukesh Ranjan, Vice President at Everest Group.

  • Enable self-service for commonly occurring issues: Leaders can create marketplace portals, one-click resolutions, FAQs, and do-it-yourself videos contextual to company needs
  • Incorporate chatbots with embedded RPA: To address key workflows and use cases such as internet issues
  • Make resources available: During weekends and graveyard shifts, have a go-to process to respond to critical outages
  • Follow the sun model: Create rotation schedules to ensure round-the-clock resolution

Read more on CIO Dive

Is Your GBS Organization Ready for IT Infrastructure Evolution to Enable Business Transformation? | Blog

A sustained focus on digital, agility, and advanced technologies is likely to prepare enterprises for the future, especially following COVID-19. Many enterprise leaders consider IT infrastructure to be the bedrock of business transformation at a time when the service delivery model has become more virtual and cloud based. This reality presents an opportunity for GBS organizations that deliver IT infrastructure services to rethink their long-term strategies to enhance their capabilities, thereby strengthening their value propositions for their enterprises.

GBS setups with strong IT infra capabilities can lead enterprise transformation

Over the past few years, several GBS organizations have built and strengthened capabilities across a wide range of IT infrastructure services. Best-in-class GBS setups have achieved significant scale and penetration for IT infrastructure delivery and now support a wide range of functions – such as cloud migration and transformation, desktop support and virtualization, and service desk – with high maturity. In fact, some centers have scaled as high as 250-300 Full Time Equivalents (FTEs) and 35-45% penetration.

At the same time, these organizations are fraught with legacy issues that need to be addressed to unlock full value. Our research reveals that most enterprises believe that their GBS’ current IT infrastructure services model is not ready to cater to the digital capabilities necessary for targeted transformation. Only GBS organizations that evolve and strengthen their IT infrastructure capabilities will be well positioned to extend their support to newer or more enhanced IT infrastructure services delivery.

The need for an IT infrastructure revolution and what it will take

The push to transform IT infrastructure in GBS setups should be driven by a business-centric approach to global business services. To enable this shift, GBS organizations should consider a new model for IT infrastructure that focuses on improving business metrics instead of pre-defined IT Service Line Agreements (SLA) and Total Cost of Operations (TCO) management. IT infrastructure must be able to support changes ushered in by rapid device proliferation, technology disruptions, business expansions, and escalating cost pressures post-COVID-19 to showcase sustained value.

To transition to this IT infrastructure state, GBS organizations must proactively start to identify skills that have a high likelihood of being replaced / becoming obsolete, as well as emerging skills. They must also prioritize emerging skills that have a higher reskilling/upskilling potential. These goals can be achieved through a comprehensive program that proactively builds capabilities in IT services delivery.

In the exhibit below, we highlight the shelf life of basic IT services skills by comparing the upskilling/reskilling potential of IT services skills with their expected extent of replacement.

Exhibit: Analysis of the shelf life of basic IT services skills

Analysis of the shelf life of basic IT services skills

In the near future, GBS organizations should leverage Artificial Intelligence (AI), analytics, and automation to further revolutionize their IT capabilities. The end goal is to transition to a self-healing, self-configuring system that can dynamically and autonomously adapt to changing business needs, thereby creating an invisible IT infrastructure model. This invisible IT infrastructure will be highly secure, require minimal oversight, function across stacks, and continuously evolve with changing business needs. By leveraging an automation-, analytics-, and AI-led delivery of infrastructure, operations, and services management, GBS organizations can truly enable enterprises to make decisions based on business imperatives.

If you’d like to know more about the key business transformation trends for enterprises in  IT infrastructure, do read our report Exploring the Enterprise Journey Towards “Invisible” IT Infrastructure or reach out to us at [email protected] or [email protected].

With Aware Automation, Enterprises Can Achieve 35% Cost Savings as Compared to Traditional Automation Approaches—Everest Group | Press Release

72% of enterprises cite IT infrastructure services as a key hurdle to becoming digital-first enterprises; new Everest Group report describes how ‘Aware’ automation—underpinned by AI and analytics—can solve this problem

According to Everest Group, aware automation can help achieve more than 35 percent cost savings as compared to traditional automation approaches and can help enterprises realize significant improvements in business operations and user experience.

With IT infrastructure complexity at an all-time high, Everest Group has found that 72 percent of enterprises cite infrastructure services (IS) as a key hurdle in becoming a digital-first enterprise. Most enterprises believe that their IT infrastructure services are not moving fast enough to support and drive the future of their business.

“Aware” automation holds promise for resolving the challenges and complexity of traditional IT infrastructure. Aware automation is a concept wherein automation systems are underpinned by artificial intelligence (AI) and analytics, making them conscious of the environment and capable of driving self-configuring, healing and evolving IT infrastructure services.

“The trinity of analytics, automation and AI can make the infrastructure run the way business needs it to, without requiring significant oversight or bandwidth,” said Ashwin Venkatesan, practice director at Everest Group. “So, in essence, this next-generation automation can make infrastructure services ‘invisible’ rather than a glaring nightmare that causes executives to lose sleep at night. Already in the last two to three years, we’ve witnessed intelligent automation making enterprise inroads, backed by a rapid proliferation and maturation of solutions in the market.”

Everest Group offers a featured analysis of aware automation in its newly released annual report on Cloud and Infrastructure Services: “AI Stands to Make IT Infrastructure Services ‘Invisible’.”  This research deep dives into the cloud and IS landscape. It provides data-driven facts and perspectives on the overall market. The research covers cloud and IS adoption trends, demand drivers, and buyer expectations. The research analyses buyer challenges, describes trends shaping the market, and provides an outlook for 2018-2019 for the broader IT as well as cloud and IS market.

Highlights of the Cloud and IS market analysis:

  • The global information technology services (ITS) market is expected to continue its modest growth rate of approximately 2 percent per annum. The collapsing of the traditional IT stacks across the previously siloed layers of applications and infrastructure is driving the demand for consulting services.
  • Emerging technologies are disrupting the infrastructure services market. There has been increased market momentum for the adoption of these technologies that are facilitating the enterprises’ journey toward digital transformation.
  • The United States takes the lion’s share (90 percent) of the deal volume emanating from North America, which itself continues to dominate the global market share (37 percent). The Nordic region witnessed an uptick in deal volume (30 percent of the deal volume in Europe), taking over the lead from the United Kingdom.
  • While the Banking, Financial Services and Insurance (BFSI) industry dominates the ITS market share (23 to 27 percent), the healthcare and life sciences vertical witnessed an above-average growth to take over a larger share of the market (8 to 10 percent), beating the retail, distribution, and consumer packaged goods (CPG) sectors.
  • Accenture and IBM continue to dominate the ITS market.

***Download a complimentary 12-page abstract of the report here.***

Remedy for frustrations in legacy IT infrastructure contracting model | Sherpas in Blue Shirts

A significant driver motivating companies to migrate workloads out of their legacy environment into the cloud is the increasing frustration of operating under onerous, complicated services contracts. Of course, these workloads migrate to the cloud and a software-defined environment primarily for greater efficiency and agility. But many workloads are too expensive and risky to migrate and thus are better suited for maintaining in a legacy environment. So, I’m calling for a better, more rational legacy infrastructure contracting vehicle. Here’s what it would look like and how companies would benefit.

What’s wrong with the typical contract?

Large, cumbersome, difficult master services agreements (MSAs) with functional areas or towers govern the legacy IT outsourcing market. No matter the function outsourced, these legacy contracts have in common several characteristics that make them too complex and make administering these contracts incredibly complicated and frustrating.

How can we engage?

Please let us know how we can help you on your journey.

Contact Us

"*" indicates required fields

Please review our Privacy Notice and check the box below to consent to the use of Personal Data that you provide.