Sekėjai

Ieškoti šiame dienoraštyje

2025 m. liepos 2 d., trečiadienis

When intelligence is available, replicable, and scalable on an almost unimaginable scale


"Is Germany well prepared for the next AI breakthrough? Unfortunately not.

"The Government knows AGI is coming" - this was the title of a recent interview in the New York Times with Ben Buchanan, the White House's top AI advisor during Joe Biden's presidency. AGI stands for "Artificial General Intelligence," meaning artificial intelligence (AI) that is cognitively equal to or superior to humans in virtually every respect. Buchanan describes how his team in the White House worked to prepare the American government for AGI. And that he expects AGI to arrive during Donald Trump's presidency.

 

Artificial intelligence that achieves at least human-level capabilities within the next four years - this is a scenario that many in expert circles now consider possible. Nobel laureate and AI researcher Geoffrey Hinton shares this view, as does his colleague Yoshua Bengio and the CEOs of the world's leading AI companies. If they are right and this scenario occurs, would Germany be prepared? Is the federal government preparing for it?

 

It doesn't look like it. A stringent political AI strategy is not yet apparent. Instead of actively shaping the transition to the AI ​​era and preparing for what is possibly the most profound technological transformation since the industrial revolution, the new federal government has so far treated artificial intelligence more like an exciting future topic among many digital topics. Elsewhere, however, AI-based value creation has not only been taking place for a long time. Among decision-makers in Washington, London, or San Francisco, the topic of AGI is also omnipresent.

 

Many political strategies and business models there are based on the assumption that intelligence will soon be available, replicable, and scalable on an almost unimaginable scale. The impact of such a scaling of artificial intelligence on the economy, scientific progress, and global power relations will be enormous.

 

There is uncertainty about exactly when AGI will be achieved. Some experts are less optimistic than Hinton, Bengio, and entrepreneurs like Sam Altman of Open AI or Dario. Amodei from Anthropic. They expect that it will take at least one more major technological innovation before machine intelligence will catch up with human intelligence in every respect. But even if it takes another ten to twenty years, Germany is completely inadequately prepared.

 

While other countries are mobilizing enormous sums to expand AI infrastructure, far too little is happening in Germany, and it's happening far too slowly.

 

In the USA, for example, $500 billion is to be invested in American AI infrastructure over the next four years as part of the so-called Stargate project alone. Fear of competition from China, currently in second place in the global AI race, is a driving factor. At the same time, Donald Trump is concluding deals for massive data centers with the United Arab Emirates and Saudi Arabia, which are likely to soon join the ranks of the most important AI nations.

 

Europe is a bystander, and even within Europe, Germany is still one of the more sluggish AI players. The AI ​​gigafactory announced in the coalition agreement – ​​a data center worth three to five billion euros, with some of the financing still unclear – lags far behind the efforts of other countries. For example, Great Britain is currently introducing "AI growth zones" where data centers can be built particularly quickly, and in France, President Macron has made AI a top priority and announced investments in at least the tens of billions.

 

There is hope that important competencies are being pooled in the new Digital Ministry and that there is a spirit of optimism. Through smart and decisive action, Germany could find and maintain its place in the AI ​​world.

 

Three priorities would have to be at the heart of such an AI policy.

 

1. Strong and resilient infrastructure

 

AI relies on enormous amounts of computing power. Data centers with special AI chips such as GPUs (Graphics Processing Units) are thus becoming a core strategic resource. Germany currently has individual flagship projects such as the JUPITER supercomputer at the Jülich Research Center, which is equipped with approximately 24,000 GPUs. By international comparison, however, this is more of a third-league level: large tech companies in the USA often have several hundred thousand GPUs alone.

 

The coalition agreement of the new federal government now envisages expanding at least one supercomputer into an "AI gigafactory" with 100,000 GPUs. This would be a step towards more AI sovereignty. However, for this infrastructure expansion to be truly successful, several key points must be considered.

 

First, computing capacity is needed not only for the development of new AI models ("training"), but also for the operation of existing AI models ("inference"). The latter will be essential, especially in a world with AGI, in order to be able to widely use the available, extremely powerful AI for all possible processes in industry and administration. Countries whose infrastructure is equipped for such scaling and can bring intelligence into comprehensive applications could leave the rest of the world behind. Therefore, Germany needs as many data centers as possible that can be used for inference – and that make the country more independent of foreign cloud providers.

 

Second, the construction of data centers, whether for training or inference, must be carried out at high speed. Approval procedures must be accelerated, and the necessary energy infrastructure must be expanded with the highest priority. This is because AI data centers require enormous amounts of electricity and maximum grid stability. Currently, the German energy grid would be overwhelmed by the operation of several such data centers. While efficiency gains are expected in the coming years, reducing the energy consumption per AI deployment, increasing demand for AI is likely to lead to a further, possibly rapid, increase in energy demand overall. If the German government is serious about its goal of making Germany an "AI nation," it must – as announced by Economics Minister Katherina Reiche – prioritize expanding the power grid and ensuring affordable energy. The "AI Growth Zones" announced by the British government demonstrate how this can work in practice: Together with local authorities, the government identifies promising locations that are particularly suitable for the construction of data centers and the strengthening of the necessary energy infrastructure. Simplified planning processes then help increase the available power capacity locally.

 

Thirdly, new AI data centers should be built with a particularly high level of security. AI models will increasingly become the target of massive cyberattacks financed by hostile states in the coming years. Even the most powerful AI models are essentially just a collection of many numbers, so-called parameters. If hackers succeed in stealing these parameters, they have access to the same AI capabilities as the developers – and can use them for military purposes, for example. Secure data centers are therefore of outstanding economic and geostrategic importance. However, according to a report by the consulting firm Gladstone, no AI data center is currently adequately protected against sabotage by foreign states. Establishing effective protection could take years. The security of highly sensitive information must therefore be planned from the outset when building AI data centers.

 

Last but not least, a strong ecosystem of companies is needed that convert computing power into innovation and growth. The fact that leading German corporations, including Siemens and SAP, want to join forces to build an AI gigafactory for the development and use of AI is a positive sign. Overall, however, the German economy is often still hesitant. The German AI startup scene could also be more vibrant, despite positive examples such as DeepL, Parloa, and Helsing. Policymakers can play a proactive role here: Regulations and real-world laboratories implemented with a sense of proportion help simplify innovation and put AI into practice. Startups and small and medium-sized enterprises also need easier access to capital and public computing power. Above all, the federal government should facilitate access to private capital, for example, by providing financial incentives for pension funds and companies to invest in AI startups.

 

If these points are taken into account, Germany can lay the foundation for an AI future with new data centers in which hardly any economic process can function without AI. This is explicitly not about self-sufficiency, as that is simply not realistic: there is currently no way around non-European components from countries such as the USA or Taiwan. The federal government must therefore work with the EU to secure the supply of high-performance chips through trade policy. The White House reportedly intends to renegotiate with each country soon about whether and how AI chips will be exported from the US. A positive outcome for Germany should be a priority for the new federal government.

 

2. Leveraging economic strengths

 

Even if all of the measures mentioned so far were consistently implemented, Germany as a location for AI would lag far behind the rest of the world’s leading AI nations for the foreseeable future. It is all the more important that Germany manages to compete above its own weight class. This requires cleverly leveraging its existing strengths. German companies can occupy important niches and thus make themselves irreplaceable at key points in the global AI value chain. Three core areas offer themselves:

 

First, the broadly diversified German industry – from mechanical and plant engineering to robotics and medical technology – is likely to be very well suited to transforming the "intelligence on demand" available in an AI world into real value creation, even if the strongest AI models continue to be developed in the USA. Even if an American AI system is used to design novel vaccines or high-tech devices, prototypes still need to be built, improved, and products mass-produced. Germany, in particular, has a competitive advantage over countries like the USA thanks to its industrial capacity. The more digital the production processes of German companies are and the more closely they are integrated with AI, the more effectively this trump card can be played.

 

Second, German companies have a wealth of industrial data. With this data, they can develop highly specialized models – as Siemens recently did, for example – that make industrial processes more efficient and resource-efficient. There is enormous potential here, especially in energy-intensive sectors.

 

And third, German universities train excellent AI talent and have strong basic research capabilities. Nevertheless, Germany has been losing many AI talents to the US for years. This brain drain must be stopped, and Germany should actively recruit AI talent currently working abroad. For many of them, it's not primarily about the money, but rather about working on exciting projects and making a difference in the world through innovative products. The conditions for this must be created now: through easier spin-offs, more capital, practice-oriented implementation of regulations such as the European AI Regulation, and unbureaucratic access to computing power. With a rapid expansion of computing capacity, German research could also venture into ambitious moonshot projects. To this end, Germany could also join forces with other democratic "middle powers" such as the United Kingdom, Canada, Australia, and other European countries to pool resources, thus working toward a "third pole" on the AI ​​map beyond the USA and China.

 

3. Governmental Capacity

 

The fact that Germany has not yet tackled the challenges in the field of AI with sufficient rigor is likely also due to the lack of concentrated technological expertise within the government apparatus. For years, AI was treated as a secondary concern, distributed decentrally across the ministries. The new Federal Ministry for Digital and State Modernization (BMDS), with clear AI expertise, is a step in the right direction and should be able to assume a central AI leadership role within the federal government over the course of the legislative term. But that alone is not enough. AI must be viewed as a core strategic issue that goes far beyond the intersection with state modernization and affects every area of ​​life.

 

State capacity is needed not only to make Germany competitive again in the field of AI, but also to avoid societal upheaval in the AGI era and safeguard national security. A report recently published by internationally recognized experts on behalf of the United Nations and 30 leading AI nations addresses, in addition to already visible dangers such as deepfakes, a number of other risks that could soon arise: AI-assisted cyberattacks, widespread job losses due to automation, or even a complete loss of control over increasingly autonomous AI systems.

 

If AGI were truly achieved in the next few years, such risks could become highly relevant during this legislative period. Germany would then be left without any security. And even if no risks to national security materialize, Germany would still be at acute risk of being completely left behind by other countries.

 

Perhaps the current moment is comparable to January 2020, when the coronavirus began to spread, but many decision-makers were not sufficiently aware that something profoundly disruptive, something world-changing, was happening. How can Germany avoid looking back on the summer of 2025 in a few years and realizing that it should have built the necessary resilience for the looming AGI era much more quickly and decisively.

 

A look at the United Kingdom shows how this could work. The British government began in 2023 to establish a world-leading AI Security Institute (AISI) as soon as possible, which will research the impact of AI on national security and advise the government. Germany should now follow suit and establish a German AI Security Institute. The new Digital Minister, Karsten Wildberger, could attach such an advisory institute to his ministry, and he should ensure that it is granted sufficient autonomy and is managed in an entrepreneurial and agile manner. Furthermore, "AGI Preparedness" should become one of the main goals of the National Security Council announced by Chancellor Friedrich Merz. It is crucial that the Chancellor makes AI a top priority and personally makes it clear to all stakeholders that he expects this issue to be addressed at the highest speed and given the highest priority. The only reason Great Britain currently has the world's best "state capacity" in the field of AI is because Rishi Sunak made this project a personal top priority and pushed it through, even in the face of resistance.

 

Germany urgently needs a strategy for the AGI era. It requires infrastructure and a wise use of the few assets our country has in the field of AI. And it requires "state capacity." Above all, however, political leaders must understand that with AI, we may find ourselves in a situation similar to that of January 2020 with Covid. "The US government knows AGI is coming," and many others know it too and are preparing for it. This must now happen in Berlin as well.

 

Monika Schnitzer is a professor at Ludwig Maximilian University in Munich and chair of the German Council of Economic Experts.

 

Daniel Privitera is Executive Director of the KIRA Center and lead writer of the International AI Safety Report.” [1]

 

The mountain gives birth to a mouse: a German AI Security Institute.

 

1. Wenn Intelligenz in kaum vorstellbarem Ausmaß verfügbar, kopierbar und skalierbar ist. Frankfurter Allgemeine Zeitung; Frankfurt. 16 June 2025: 18.  Von Monika Schnitzer und Daniel Privitera

Dirbtinio intelekto vaidmuo didina JAV technologijų lyderių atlyginimus --- JAV įmonių informacinių technologijų vadovų bazinis atlyginimas didėja bent 20 %


„Aukščiausio lygio JAV įmonių technologijų vadovai gauna didesnius atlyginimus, nes dirbtinis intelektas plečia jų vadovavimo ir atsakomybės apimtį.

 

Nors daugelis geriausiai apmokamų technologijų vadovų yra technologijų vadovai, taip pat auga gerai apmokamų informacinių technologijų vadovų grupė viešosiose bendrovėse, rodo vadovų ir valdybų atlyginimų analizės įmonės „C-Suite Comp“ duomenys.

 

Remiantis įmonės atlikta 3 930 viešųjų bendrovių apžvalga, vidutinis technologijų vadovų atlyginimas 2024 m., palyginti su praėjusiais metais, išaugo 30,81 % ir siekė maždaug 2,4 mln. USD.

 

Tuo tarpu IT vadovų bazinis atlyginimas auga maždaug 20–30 %, teigia IT vadovų įdarbinimo įmonės „Heller Search Associates“ generalinė direktorė Marthos Heller. Tai panašu į vadovų įdarbinimo ir konsultacijų įmonės „Korn Ferry“ išvadas, kurios teigė, kad bendras IT vadovų atlyginimas didėja... 15–25 %.

 

Remiantis „C-Suite Comp“ duomenimis, 10 geriausiai apmokamų, šiuo metu dirbančių IT vadovų yra iš įvairių sektorių, įskaitant finansines paslaugas, mažmeninę prekybą, sveikatos priežiūrą ir logistiką.

 

Kai kurios iš bendrovių, gaunančių didžiausias išmokas, yra finansinių paslaugų gigantai „Wells Fargo“ ir „Visa“, sandėlių klubų tinklas „Costco Wholesale“ ir sveikatos priežiūros produktų pardavėja „Solventum“. Keturios IT vadovai yra moterys. Visos bendrovės, išskyrus dvi – sveikatos priežiūros personalo atrankos įmonė „AMN Healthcare Services“ ir „Lineage“, didžiausia pasaulyje šaldytų sandėlių tiekėja – yra įtrauktos į „S&P 500“.

 

Vienas iš veiksnių, lemiančių didesnius IT vadovų ir kitų technologijų vadovų atlyginimus, yra dirbtinio intelekto svarba įmonėms – nesvarbu, ar jos naudoja technologijas savo organizacijose, ar diegia jas savo produktams ir paslaugoms klientams transformuoti.

 

Technologijų lyderiams daromas vis didesnis spaudimas parodyti, kad dirbtinis intelektas gali duoti verslo rezultatų.

 

„Bendras atlyginimas didėja todėl, kad baimė, netikrumas, abejonės ir neviltis pateko į valdybą ir vadovų sudėtį“, – sakė Helleris. „Jie pagaliau, stumdami „DI, supraskite, kad šis IT vadovo vaidmuo yra svarbus.“

 

Kita priežastis, kodėl IT vadovų atlyginimai kyla: šie technologijų vadovai tiesiog prisiima daugiau darbo ir kartu su juo gauna daugiau pareigų.

 

IT vadovai, tradiciškai priskirti IT sistemų, tokių kaip duomenų centrai, debesų kompiuterija ir verslo programinė įranga, valdymui, ir toliau prisiima vis didesnę atsakomybės už verslo vadovavimą dalį.

 

„Technologijų lyderiai tapo protingesni“, – sakė Alyse Egol, vyresnioji „Korn Ferry“ skaitmeninių, technologijų ir saugumo pareigūnų praktikos klientų partnerė. „Jie sako: „Mano vertė yra daugiau nei vien tik mašinos veikimo palaikymas.“

 

Tiems IT vadovams, kurie gauna didžiausius atlyginimus, tai reiškia, kad padidėja jų turimų pareigų skaičius.

 

Septyni iš dešimties geriausiai apmokamų IT vadovų turi daugiau nei vieną pareigybę, šeši iš jų turi dvigubas pareigas – IT vadovo ir vyriausiojo skaitmeninio vadovo. Du iš geriausiai apmokamų IT vadovų taip pat turi vyriausiojo transformacijos vadovo pareigas.

 

„FedEx“ atstovas Sriram Krishnasamy turi vyriausiojo skaitmeninio ir informacijos vadovo bei vyriausiojo transformacijos vadovo. „AMN Healthcare“ atstovas Markas Haganas eina vyriausiojo informacinių technologijų vadovo ir vyriausiojo skaitmeninių technologijų vadovo pareigas, o 2024 m. lapkritį, prieš atvykstant finansų direktoriui ir operacijų vadovui, ėjo papildomas pareigas už operacijas, teigė bendrovė.

 

Šios pareigos paprastai yra platesnio masto nei tradicinės IT vadovo pareigos, o dėl technologijų strategijos nustatymo atsiranda papildoma atsakomybė už verslo rezultatus, teigia vadovų paieškos specialistai.

 

„Biudžeto požiūriu, tai dvi funkcinės lyderystės už vieno kainą“, – sakė Helleris. „Kai kas nors atsiskaito generaliniam direktoriui ir turi daug skirtingų pareigų, tai reiškia, kad generalinis direktorius jį laiko partneriu.“

 

Tuo pačiu metu ne visas technologijų pareigas didelėse įmonėse visada atlieka vienas asmuo.

 

Pusė iš daugiau nei 600 vyresniųjų JAV technologijų lyderių, kuriuos neseniai apklausė „Deloitte“, teigė, kad jų organizacijoje yra keturi ar daugiau technologijų C lygio vadovų. „Deloitte“ yra „CIO Journal“ rėmėja.

 

O įmonės, kurios samdo kelis technologijų vadovus, linkusios paskirstyti savo atlyginimus tarp technologijų sričių. vadovai, taip sumažindami šių vadovų atlyginimus, teigė konsultacijų įmonės „Janco Associates“ generalinis direktorius Viktoras Janulaitis.

 

Nors dirbtinis intelektas padidino kai kurių IT vadovų atlyginimus, „Korn Ferry“ atstovas Egolas teigė, kad ne kiekvienai įmonei aišku, kaip susieti dirbtinio intelekto projektus su technologijų vadovų atlyginimais.

 

„Įmonės vis dar bando išsiaiškinti, ką su tuo daryti ir kas turi tikros patirties“, – sakė ji. „Vis dar neaišku, kaip tai ilgainiui bus susieta su atlyginimais.“" [1]

 

1. AI Roles Boost Salary For U.S. Tech Leaders --- Chief information officers' base pay at U.S. companies rises by at least 20%. Lin, Belle.  Wall Street Journal, Eastern edition; New York, N.Y.. 02 July 2025: B4. 

AI Roles Boost Salary For U.S. Tech Leaders --- Chief information officers' base pay at U.S. companies rises by at least 20%


“Top technology leaders at U.S. companies are getting bigger paychecks as artificial intelligence is expanding the scope of their leadership and responsibilities.

 

While many of the highest paid technology leaders are chief technology officers, there is also a growing cohort of well-compensated chief information officers at public companies, according to data from C-Suite Comp, an executive and board pay analytics firm.

 

The median pay for CTOs rose 30.81% in 2024 from the previous year, to roughly $2.4 million, according to the firm's review of 3,930 public companies.

 

The base compensation for CIOs, meanwhile, is growing by roughly 20% to 30% according to Martha Heller, chief executive of IT executive recruiting firm Heller Search Associates. That is similar to findings from executive recruiter and consulting firm Korn Ferry, which said total compensation for CIOs is increasing by 15% to 25%.

 

The 10 top-paid, currently employed IT executives, according to C-Suite Comp's data, are from a variety of sectors, including financial services, retail, healthcare and logistics.

 

Some of the companies making the biggest payouts include financial services giants Wells Fargo and Visa, the warehouse-club chain Costco Wholesale, and healthcare products seller Solventum. Four of the CIOs are women. All but two of the companies -- healthcare staffing firm AMN Healthcare Services and Lineage, the world's largest refrigerated-warehousing provider -- are in the S&P 500.

 

One factor driving higher CIO and other tech execs' pay is the importance of AI for businesses -- whether using the technology inside their organizations or deploying it to transform their products and services for customers.

 

And there has been mounting pressure on technology leaders to show AI can produce business results.

 

"The reason total compensation is rising is because fear, uncertainty, doubt and desperation has entered the board and executive suite," Heller said. "They finally, with a push in AI, understand that this CIO role is important."

 

Another reason CIO compensation is going up: These technology executives are simply taking on more work, and with it, adding more titles.

 

CIOs, traditionally relegated to managing back-office information-technology systems like data centers, cloud computing and business software, continue to take an increasing share of responsibility for business leadership.

 

"Technology leaders have wised up," said Alyse Egol, a senior client partner of Korn Ferry's digital, technology and security officers practice. "They're saying, 'My value is above and beyond just keeping the machine running.'"

 

For the CIOs taking home the biggest paychecks, that comes with an increase in the number of titles they hold.

 

Seven of the 10 top-paid CIOs have more than one title, with six holding the dual titles of CIO and chief digital officer. Two of the top-paid CIOs also hold the title chief transformation officer.

 

FedEx's Sriram Krishnasamy holds the titles of chief digital and information officer, and chief transformation officer. AMN Healthcare's Mark Hagan holds the titles of chief information officer and chief digital officer, and in 2024, held additional responsibilities for operations before the arrival of its CFO and COO in November 2024, the company said.

 

Those roles tend to be more wide-ranging than traditional CIO roles, with added responsibility for business outcomes as a result of setting tech strategy, executive headhunters say.

 

"From a budget perspective, it's two functional leaderships for the price of one," said Heller. "When somebody reports to a CEO and they have a bunch of different titles, it means the CEO is considering them to be a partner."

 

At the same time, not all tech roles at large enterprises are always held by one person.

 

Half of the over 600 senior-level U.S. technology leaders recently surveyed by Deloitte said they have four or more tech C-suite leaders at their organization. Deloitte is a sponsor of CIO Journal.

 

And the companies that do employ several tech execs tend to spread out their paychecks between technology leaders, putting a damper on those executives' compensation, said Victor Janulaitis, CEO of consulting firm Janco Associates.

 

While AI has boosted the paychecks of some CIOs, Korn Ferry's Egol said it isn't yet clear for every company how to tie AI projects with tech execs' compensation.

 

"Companies are still trying to figure out what to do with it, and who has the real experience," she said. "The jury's still out about how it's really going to tie into compensation in the long run."” [1]

 

1. AI Roles Boost Salary For U.S. Tech Leaders --- Chief information officers' base pay at U.S. companies rises by at least 20%. Lin, Belle.  Wall Street Journal, Eastern edition; New York, N.Y.. 02 July 2025: B4. 

Kinijos dirbtinio intelekto lustų gamintojai planuoja pradinius viešus akcijų siūlymus


„Dvi Kinijos dirbtinio intelekto lustų bendrovės siekia pritraukti 1,66 mlrd. JAV dolerių per pradinius viešus akcijų siūlymus (IPO), Kinijai dedant daugiau pastangų, siekiant lustų nepriklausomybės, didėjant JAV ir Kinijos technologijų konkurencijai.

 

Pekine įsikūrusi „Moore Threads“ planuoja pritraukti 8 mlrd. juanių, tai atitinka 1,12 mlrd. JAV dolerių, o Šanchajuje įsikūrusi „MetaX“ siekia 3,9 mlrd. juanių, teigiama pirmadienį Šanchajaus vertybinių popierių biržai pateiktuose prospektuose.

 

2020 m. buvusio „Nvidia“ vadovo Zhang Jianzhongo įkurta „Moore Threads“ specializuojasi grafikos apdorojimo įrenginių, skirtų dirbtinio intelekto mokymams, projektavime. Bendrovė planuoja panaudoti IPO lėšas naujų dirbtinio intelekto lustų tyrimams ir plėtrai finansuoti bei apyvartiniam kapitalui didinti.

 

„MetaX“, kurią taip pat 2020 m. įkūrė buvę AMD darbuotojai, įskaitant pirmininką Chen Weiliang, daugiausia dėmesio skiria pilno paketo [1] GPU lustams ir susijusiems sprendimams. Ji ketina panaudoti lėšas didelio našumo GPU tyrimams ir plėtrai remti.

 

Abi bendrovės siekia įtraukti savo akcijas į Šanchajaus STAR rinką – į technologijas orientuotą Šanchajaus biržos valdybą.

 

„Moore Threads“ buvo įtraukta į JAV subjektų sąrašą 2023 m. spalio mėn., o tai riboja jos prieigą prie amerikietiškų technologijų ir įrangos.

 

Nepaisant spartaus pajamų augimo, abi bendrovės ir toliau patiria didelių nuostolių, nes plečiasi ir daug investuoja į mokslinius tyrimus ir plėtrą.

 

„Moore Threads“ pajamos 2024 m. išaugo daugiau, nei tris kartus, iki 438,85 mln. juanių, o grynasis nuostolis sumažėjo, bet išliko 1,49 mlrd. juanių.

 

„MetaX“ pajamos 2024 m. išaugo daugiau, nei dešimt kartų, iki 743,1 mln., palyginti su 53 mln. juanių prieš metus. Tačiau ji patyrė 232,5 mln. juanių grynąjį nuostolį, kurį lėmė maža vietinės gamybos lustų skverbtis į rinką, ribotas savarankiškai sukurtų GPU pardavimo mastas ir didelės mokslinių tyrimų ir plėtros išlaidos.“ [3]

 

1. Pilno paketo GPU lustai – tai bendrovių, kurios siūlo ne tik grafikos procesoriaus (GPU) silicį, produktai. Jie teikia visapusišką sprendimą, įskaitant lustą, susijusias sistemas ir jį palaikančią programinės įrangos ekosistemą.

 

„NVIDIA“ yra puikus įmonės, teikiančios pilno paketo sprendimus, ypač dirbtinio intelekto srityje, pavyzdys.

 

Pilno paketo GPU sprendimas apima:

 

Lustus: patį GPU (pvz., „Blackwell GPU“) kartu su pagalbiniais lustais, tokiais, kaip procesoriai („Grace CPU“), duomenų apdorojimo įrenginiai („BlueField“), tinklo sąsajos plokštės („ConnectX“) ir komutatoriai („NVLink Switch“, „Spectrum Ethernet“ komutatorius, „Quantum InfiniBand“ komutatorius).

 

Sistemas: Aparatinės įrangos platformas, kuriose talpinami ir integruojami įvairūs lustai, pvz., „NVIDIA GB200 NVL72“ stelažinis sprendimas.

 

Programinę įrangą: Programinės įrangos ekosistema leidžia kūrėjams išnaudoti aparatinės įrangos galią. Tai apima:

 

Optimizavimą konkretiems darbo krūviams, pvz., dirbtinio intelekto išvados.

 

Programinės įrangos bibliotekas ir įrankius tokioms užduotims, kaip kvantavimas [2].

 

Įrankių rinkinius, bibliotekas ir kompiliatorius, skirtus kurti didelio našumo programas, pvz., „CUDA“ („NVIDIA“), „ROCm“ (AMD) ir „oneAPI“ („Intel“).

 

Derinimo ir našumo analizės įrankius.

 

Įmonės, siūlančios pilno paketo GPU sprendimus:

 

NVIDIA: Pirmaujanti pilno paketo GPU sprendimų, ypač skirtų dirbtiniam intelektui ir didelio našumo skaičiavimams, tiekėja. NVIDIA integruoja aparatinę ir programinę įrangą, kad optimizuotų našumą sudėtingoms programoms.

 

Kiti pretendentai: Kitos įmonės taip pat kuria integruotus aparatinės ir programinės įrangos sprendimus, kad galėtų konkuruoti GPU rinkoje.

 

Pilno paketo GPU lustas (arba sprendimas) apima ne tik aparatinę įrangą, bet ir programinę įrangą bei sistemas, reikalingas visapusiškai, optimizuotai skaičiavimo platformai sukurti. Šis metodas ypač aktualus tokiose srityse, kaip dirbtinis intelektas.

 

2. Kvantavimas – tai nepertraukiamų duomenų konvertavimo į atskirus, skaitmeninius vaizdus procesas. Tai apima duomenų tikslumo mažinimą, dažnai iš didesnio tikslumo formato (pvz., 32 bitų slankiojo kablelio) į mažesnio tikslumo formatą (pvz., 8 bitų sveikuosius skaičius). Ši technika plačiai naudojama atminties naudojimui sumažinti, skaičiavimo efektyvumui pagerinti ir diegimui įrenginiuose, kuriuose trūksta išteklių.

 

Išsamiau:

 

Pagrindinė koncepcija:

Kvantavimas susieja nepertraukiamą reikšmių diapazoną su mažesniu, atskiru reikšmių rinkiniu. Įsivaizduokite tai, kaip skaičių apvalinimą iki artimiausio sveikojo skaičiaus arba spalvų atvaizdavimą, naudojant ribotą paletę.

 

Taikymas:

 

Mašininis mokymasis: sumažina modelio dydį, pagreitina išvadų darymą (apmokyto modelio naudojimo prognozėms daryti procesą) ir leidžia diegti periferiniuose įrenginiuose.

 

Signalų apdorojimas: konvertuoja nuolatinius signalus (pvz., garsą ar vaizdą) į skaitmeninį formatą, kurį gali apdoroti kompiuteriai.

 

Muzikos kūrimas: suderina muzikos natas su laiko tinkleliu, kad ištaisytų laiko netikslumus.

 

Vaizdų apdorojimas: sumažina vaizde naudojamų spalvų skaičių, dažnai glaudinimui arba vaizdų rodymui įrenginiuose, kuriuose ribotas spalvų palaikymas.

 

Fizika: kvantinėje fizikoje energija, impulsas ir kiti dydžiai yra kvantuojami, o tai reiškia, kad jie gali įgauti tik tam tikras diskrečias vertes.

 

Privalumai:

 

Mažesni modelio dydžiai: mažesnis atminties kiekis saugojimui ir greitesnis įkėlimo laikas.

 

Greitesnis išvadų darymas: sveikųjų skaičių skaičiavimas aritmetinėse operacijose paprastai yra greitesnis, nei slankiojo kablelio operacijos, todėl prognozės pateikiamos greičiau.

 

Mažesnės energijos sąnaudos: duomenims apdoroti mažesniu tikslumu reikia mažiau energijos.

 

Kompromisai:

 

Tikslumo praradimas: tikslumo sumažinimas sukelia kvantavimo paklaidą, kuri gali turėti įtakos modelio tikslumui.

 

Tinkamos pusiausvyros paieška: tikslas – rasti tinkamą pusiausvyrą tarp modelio dydžio mažinimo ir priimtino tikslumo išlaikymo.

 

Metodai:

 

Kvantavimas po mokymo (PTQ): modelis kvantuojamas po jo apmokymo, nereikalaujant papildomų mokymo duomenų.

 

Kvantavimo sąmoningas mokymas (QAT): kvantavimas įtraukiamas į mokymo procesą, siekiant sumažinti tikslumo praradimą.

 

Įvairūs duomenų tipai: kvantavime dažniausiai naudojami duomenų tipai yra 8 bitų sveikieji skaičiai (int8), 16 bitų slankiojo kablelio skaičiai (fp16) ir smegenų slankiojo kablelio skaičiai 16 (bf16).

 

3. Chinese AI Chipmakers Plan IPOs. Qin, Sherry.  Wall Street Journal, Eastern edition; New York, N.Y.. 02 July 2025: B3.