AI Disruption

From cloud to edge: the next wave of AI opportunity

As the focus in artificial intelligence (AI) moves beyond rapid model training toward real-world deployment, attention is increasingly shifting to inference, and how AI can deliver fast, efficient, and scalable outcomes.

The training of large language models (LLMs) requires huge clusters of AI chips and is almost exclusively done on the cloud, where hyperscalers rent out compute capacity consisting of graphical processing units (GPUs) and application-specific integrated circuits (ASICs). Inference, on the other hand, is the process of applying knowledge to real world use cases. This uses a different mix of chips with the aim of low latency (responding to requests quickly) and high throughput (processing a large number of requests).

While the training of models has progressed quickly over recent years, prioritising speed over cost efficiency, we are reaching an inflection point for AI as the advancement of LLMs has enabled the next phase of inference, and AI’s commercial application, to take centre stage. What is stark is how quickly we’ve reached this inflection point. Andrej Karpathy (founding member of OpenAI and the individual who coined “vibe coding”) stated in October 2025 that the hype around coding agents was “exaggerated”. Then, just 3 months later, he stated that:

“LLM agent capabilities (Claude & Codex especially) have crossed some kind of threshold of coherence around December 2025 and caused a phase shift in software engineering… This is easily the biggest change to my basic coding workflow in 2 decades of programming and it happened over the course of a few weeks.”

This is not the first time a frontier technology has been underestimated, but the speed of change here has been remarkable. Gartner, the research and advisory firm, has predicted that, in 2026, 55% of AI-optimised infrastructure as a service (IaaS) spending will go to inference workloads, rising to over 65% in 2029. This raises the question of what the most suitable architecture will be to optimise this next phase of inference, with edge AI –running AI models locally, or close to use cases, rather than on the cloud – being a leading candidate.

From content delivery to distributed AI infrastructure

Compared to centralised cloud computing, edge AI makes use of end devices and/or a network of points of presence that are located geographically closer to the query. In this respect, it is not a new architecture but consists of many elements previously known as the “internet of things”. The customer base of these distributed networks has traditionally been entertainment services such as gaming, and media streaming, and yet with the ongoing rapid growth in AI, using all available channels to distribute compute is becoming more important than ever. A sub-division of this architecture that is seeing a great deal of focus is the network of small datacentres, known as content delivery networks (CDNs). Due to the distributed nature of these CDNs, they now have a role to play in AI inference.

This shift from CDNs to edge AI providers can be seen in the recent USD 1.8 billion deal between Akamai and Anthropic. The legacy CDN has over 4,300 points of presence across 130 countries, and while initially created from content-delivery for entertainment services, this network also maps cleanly onto distributed inference at the edge. This example highlights how edge AI will grow to complement cloud AI, becoming a vital tool in the efficient deployment of AI inference. The benefits of shifting inference to the edge include lower latency, increased throughput, availability, privacy, fewer bandwidth bottlenecks, and reduced cost per query due to the lower marginal cost of inference on local devices.

While AI models with a significant degree of complexity must be processed on the cloud due to compute limitations on the edge, inference for more focused applications, where speed of inference and high throughput are key determinants of efficacy, could be optimised on the edge. Examples include, but are not limited to:

  • Autonomous vehicles – benefit through speed of inference and high throughput
  • Wearables/AR/VR – benefit through constant availability
  • Industrial Robotics – benefit through availability and cost per query
  • Healthcare – benefit through privacy and fewer bandwidth bottlenecks

A bifurcated, tiered inference architecture will likely emerge where low-cost, latency-sensitive tasks are executed on the edge, while more complex workloads remain in the cloud, with routing determined by compute intensity. As efficiency gains expand capabilities across both layers, competitive advantage is likely to shift toward workload routing and cost optimisation rather than model scale alone.

New opportunities across the AI value chain

The shift to inference on the edge is creating new investment opportunities, alongside expanding existing investment cases. And many of these opportunities are aligned with the themes in global tech that we have already identified.

The clearest new opportunity is with the transition of CDNs into edge AI providers. Such providers will get a boost from increased revenue growth for their network services, going from commoditised bandwidth providers to critical infrastructure for real-time inference. This also pairs well with these providers’ existing services, as many also offer cybersecurity and networking functionality. The shift would also likely come with a re-rating, something which has occurred across many other subsectors heavily exposed to AI.

Another existing investment opportunity and Theme of 2025 for the Global Technology team that will likely get renewed impetus is the optical networking industry which facilitates the high-speed interconnection of distribution networks. This industry will benefit from expansion of the AI network from the central hyperscaler cloud architecture to the vastly expanded edge network, encompassing thousands of points of presence. Optical networking will enable the increased throughput and speed of traffic which accompanies inference, and in particular inference from distributed edge networks.

Inference on the edge will also further drive the revival of analog semiconductors as AI begins to interact with the real world. The growth of analog semiconductor content per device, along with a greater number of distributed endpoints from inference on the edge, will further propel this revival. This was one of the Global Technology team’s Themes for 2026 and we remain confident in this theme playing out.

Finally, where cloud architecture has caused a huge supercycle in memory with a focus on HBM and DRAM which optimises for throughput, inference on the edge could see a broadening in the demand for different types of memory. The adoption of KV Cache (Key-Value Cache) as AI’s short-term working memory during a conversation highlights this as without it, the AI would act like it has amnesia, forgetting the beginning of your prompt every time it types a new word. The KV Cache must be saved in the computer’s fastest memory (DRAM) so the AI can look at it instantly, reducing latency. Regardless of the optimal memory for inference at the edge, the likely outcome is additional fuel for the memory supercycle and a potential broadening of memory demand. This is another Theme for 2026 within the Global Technology team.

Together, these developments highlight how inference on the edge is not just a technological shift, but a structural expansion of the AI ecosystem, unlocking new efficiencies, applications, and investment opportunities across the value chain.

Information herein is based on sources we believe to be accurate and reliable as at the date it was made. We reserve the right to revise any information herein at any time without notice. No offer or solicitation to buy or sell securities and no investment advice or recommendation is made herein. In making investment decisions, investors should not rely solely on this material but should seek independent professional advice. However, if you choose not to seek professional advice, you should consider the suitability of the product for yourself. Investment involves risks including the possible loss of principal amount invested and risks associated with investment in emerging and less developed markets. Past performance of the fund manager(s), or any prediction, projection or forecast, is not indicative of future performance. This material has not been reviewed by any regulatory authorities.

Issuer:
Hong Kong – Allianz Global Investors Asia Pacific Ltd.

5661105

Allianz Global Investors

You are leaving this website and being re-directed to the below website. This does not imply any approval or endorsement of the information by Allianz Global Investors Asia Pacific Limited contained in the redirected website nor does Allianz Global Investors Asia Pacific Limited accept any responsibility or liability in connection with this hyperlink and the information contained herein. Please keep in mind that the redirected website may contain funds and strategies not authorized for offering to the public in your jurisdiction. Besides, please also take note on the redirected website’s terms and conditions, privacy and security policies, or other legal information. By clicking “Continue”, you confirm you acknowledge the details mentioned above and would like to continue accessing the redirected website. Please click “Stay here” if you have any concerns.

Welcome to Allianz Global Investors

Select your language
  • 中文(繁體)
  • English
Select Role
  • Individual Investor
  • Intermediaries
  • Other Investors
  • Pension Investors
  • Allianz Global Investors Fund (“AGIF”)

    • Allianz Global Investors Fund (“AGIF”) as an umbrella fund under the UCITS regulations has within it different Sub-Funds investing in fixed income securities, equities, and derivative instruments, each with a different investment objective and/or risk profile.

    • All Sub-Funds may invest in financial derivative instruments (“FDI”) which may expose to higher leverage, counterparty, liquidity, valuation, volatility, market and over the counter transaction risks. A Sub-Fund’s net derivative exposure may be up to 50% of its net asset value (“NAV”).
      • Some Sub-Funds as part of their investments may invest in any one or a combination of the instruments such as fixed income securities, emerging market securities, and/or mortgage-backed securities, asset-backed securities, property-backed securities (especially REITs) and/or structured products and/or FDI, exposing to various potential risks (including leverage, counterparty, liquidity, valuation, volatility, market, fluctuations in the value of and the rental income received in respect of the underlying property, and over the counter transaction risks).

    • Some Sub-Funds may invest in single countries or industry sectors (in particular small/mid cap companies) which may reduce risk diversification. Some Sub-Funds are exposed to significant risks which include investment/general market, country and region, emerging market (such as Mainland China), creditworthiness/credit rating/downgrading, default, asset allocation, interest rate, volatility and liquidity, counterparty, sovereign debt, valuation, credit rating agency, company-specific, currency (in particular RMB), RMB debt securities and Mainland China tax risks.

    • Some Sub-Funds may invest in convertible bonds, high-yield, non-investment grade investments and unrated securities that may be subject to higher risks (including volatility, loss of principal and interest, creditworthiness and downgrading, default, interest rate, general market and liquidity risks) and therefore may adversely impact the net asset value of the Sub-Funds. Convertibles will be exposed prepayment risk, equity movement and greater volatility than straight bond investments.

    • Some Sub-Funds may invest a significant portion of the assets in interest-bearing securities issued or guaranteed by a non-investment grade sovereign issuer (e.g. Philippines) and is subject to higher risks of liquidity, credit, concentration and default of the sovereign issuer as well as greater volatility and higher risk profile that may result in significant losses to the investors.

    • Some Sub-Funds may invest in European countries. The economic and financial difficulties in Europe may get worse and adversely affect the Sub-Funds (such as increased volatility, liquidity and currency risks associated with investments in Europe).

    • Some Sub-Funds may invest in the China A-Shares market, China B-Shares market and/or debt securities directly via the Stock Connect or the China Interbank Bond Market or Bond Connect or other foreign access regimes and/or other permitted means and/or indirectly through all eligible instruments and thus is subject to the associated risks (including quota limitations, change in rule and regulations, repatriation of the Sub-Fund’s monies, trade restrictions, clearing and settlement, China market volatility and uncertainty, China market volatility and uncertainty, potential clearing and/or settlement difficulties, change in economic, social and political policy in the PRC and Mainland China tax risks).

    • Some Sub-Funds may adopt the following strategies, Socially Responsible Investment (Proprietary Scoring) Strategy, SDG-Aligned Strategy, Sustainability Key Performance Indicator Strategy (Relative), Green Bond Strategy, Multi Asset Sustainable Strategy, Sustainability Key Performance Indicator Strategy (Absolute Threshold), Environment, Social and Governance (“ESG”) Score Strategy, and Sustainability Key Performance Indicator Strategy (Absolute). The Sub-Funds may be exposed to sustainable investment risks relating to the strategies (such as foregoing opportunities to buy certain securities when it might otherwise be advantageous to do so, selling securities when it might be disadvantageous to do so, and/or relying on information and data from third party ESG research data providers and internal analyses which may be subjective, incomplete, inaccurate or unavailable and/or reducing risk diversifications compared to broadly based funds). Also, some Sub-Funds may be particularly focusing on the greenhouse gas emissions (“GHG”) efficiency of the investee companies rather than their financial performance. These may have an adverse impact on the performance of the Sub-Funds.

    • Some Sub-Funds may invest in share class with fixed distribution percentage (Class AMf). Investors should note that fixed distribution percentage is not guaranteed. The share class is not an alternative to fixed interest paying investment. The percentage of distributions paid by these share classes is unrelated to expected or past income or returns of these share classes or the Sub-Funds. Distribution will continue even the Sub-Fund has negative returns and may adversely impact the net asset value of the Sub-Fund. Positive distribution yield does not imply positive return.

    • Investment involves risks that could result in loss of part or entire amount of investors’ investment.

    • In making investment decisions, investors should not rely solely on this [website/material].

       

    Note: Dividend payments may, at the sole discretion of the Investment Manager, be made out of the Sub-Fund’s capital or effectively out of the Sub-Fund’s capital which represents a return or withdrawal of part of the amount investors originally invested and/or capital gains attributable to the original investment. This may result in an immediate decrease in the NAV per share and the capital of the Sub-Fund available for investment in the future and capital growth may be reduced, in particular for hedged share classes for which the distribution amount and NAV of any hedged share classes (HSC) may be adversely affected by differences in the interests rates of the reference currency of the HSC and the base currency of the respective Sub-Fund, particularly if such HSC are applying the IRD Neutral Policy. Dividend payments are applicable for Class A/AM/AMg/AMi/AMgi/AQ Dis (Annually/Monthly/Quarterly distribution) and for reference only but not guaranteed. Positive distribution yield does not imply positive return. For details, please refer to the Sub-Fund’s distribution policy disclosed in the offering documents.

     


    Allianz Global Investors Asia Fund

    • Allianz Global Investors Asia Fund (the “Trust”) is an umbrella unit trust constituted under the laws of Hong Kong pursuant to the Trust Deed. Allianz Thematic Income and Allianz Selection Income and Growth and Allianz Yield Plus Fund are the sub-funds of the Trust (each a “Sub-Fund”) investing in fixed income securities, equities and derivative instruments, each with a different investment objective and/or risk profile.

    • Some Sub-Funds are exposed to significant risks which include investment/general market, company-specific, emerging market, creditworthiness/credit rating/downgrading, default, volatility and liquidity, valuation, sovereign debt, thematic concentration, thematic-based investment strategy, counterparty, interest rate changes, country and region, asset allocation risks and currency (such as exchange controls, in particular RMB), and the adverse impact on RMB share classes due to currency depreciation.

    • A Sub-Fund may invest in asset-backed securities (“ABS”) and mortgage-backed securities (“MBS”) which may be highly illiquid and prone to substantial price volatility. These instruments may be subject to greater general market risk, concentration risk, credit and counterparty default risk, liquidity risk and interest rate risk compared to other debt securities.

    • Some Sub-Funds may invest in high-yield (non-investment grade and unrated) investments and convertible bonds which may be subject to higher risks, such as volatility, creditworthiness, default, interest rate changes, general market and liquidity risks and therefore may adversely impact the net asset value of the Sub-Fund.

    • All Sub-Funds may invest in financial derivative instruments (“FDI”) which may expose to higher leverage, counterparty, liquidity, valuation, volatility, market and over the counter transaction risks. The use of derivatives may result in losses to the Sub-Funds which are greater than the amount originally invested. A Sub-Fund’s net derivative exposure may be up to 50% of its net asset value (“NAV”).

    • These investments may involve risks that could result in loss of part or entire amount of investors’ investment.

    • In making investment decisions, investors should not rely solely on this website.

    Note: Dividend payments may, at the sole discretion of the Investment Manager, be made out of the Sub-Fund’s income and/or capital which in the latter case represents a return or withdrawal of part of the amount investors originally invested and/or capital gains attributable to the original investment. This may result in an immediate decrease in the NAV per distribution unit and the capital of the Sub-Fund available for investment in the future and capital growth may be reduced, in particular for hedged share classes for which the distribution amount and NAV of any hedged share classes (HSC) may be adversely affected by differences in the interests rates of the reference currency of the HSC and the base currency of the Sub-Fund, particularly if such HSC are applying the IRD Neutral Policy. Dividend payments are applicable for Class A/AM/AMg/AMi/AMgi Dis (Annually/Monthly distribution) and for reference only but not guaranteed. Positive distribution yield does not imply positive return. For details, please refer to the Sub-Fund’s distribution policy disclosed in the offering documents.

     

Please indicate you have read and understood the Important Notice.