<h2 id="abstract">Abstract</h2>

<p>We explore the AI2050 “hard problems” that block the promise of AI and cause AI
risks: (1) developing general capabilities of the systems; (2) assuring the
performance of AI systems and their training processes; (3) aligning system
goals with human goals; (4) enabling great applications of AI in real life;
(5) addressing economic disruptions; (6) ensuring the participation of all;
(7) ensuring socially responsible deployment; (8) addressing geopolitical
disruptions that AI causes; (9) promoting sound governance of the technology;
and (10) managing the philosophical disruptions for humans living in the age of
AI. For each problem, we outline the area, identify significant recent work, and
suggest ways forward.</p>

<p><em>Note: this paper reviews literature through January 2023.</em></p>

<h2 id="full-text">Full text</h2>

<blockquote>
  <p>Converted from the arXiv PDF (2402.04464). This is a ~43,000-word survey; rather than
inline it verbatim, this section reproduces the abstract, the background, each of the
ten Hard Problems with its (subjunctive) headline and a faithful synthesis, and the
closing analysis. Figures and the ~600-item reference list are omitted. Contribution
note: Garfinkel wrote the introduction, wicked-problems discussion, and conclusion;
Garfinkel and Leech jointly wrote the history; the per-problem analysis is by Leech,
Zhuravlev, Yagudin, and Briand, edited by Garfinkel.</p>
</blockquote>

<h3 id="background">Background</h3>

<p>From the coining of “artificial intelligence” onward, AI progress has been coupled to
improvements in computing power and data storage. For decades progress was slow; we now
appear to be in the “second half of the chessboard,” where the acceleration of
capabilities is impossible to ignore (so many specific claims about AI limitations here
may be invalid within a year or two). Big improvements rely on scaling compute (tens of
thousands of machines, millions of cores) and breakthroughs in parallelized training.
The last decade saw qualitative changes: above-human perception on some tasks,
multi-task and multi-modal ability, natural-language interfaces, and human-like
creativity (diffusion models). But the grandest promises remain unfulfilled —
self-driving cars still hover at the bar of practicality; AI for healthcare penetrates
clinical practice only slowly, reflecting reliability, workflow, and usability issues.</p>

<h3 id="the-ten-hard-problems">The Ten Hard Problems</h3>

<p>Each headline is subjunctive, conditioned on our collective success by 2050.</p>

<p><strong>HP#1 — Capabilities.</strong> <em>“By 2050 we will have solved the scientific and technological
limitations in current AI critical to enabling further breakthrough progress and
powerful AI.”</em> Deep learning’s triumph is that it productively enlists vast compute and
data and keeps improving as it scales; today’s best systems are billion-to-trillion-
parameter networks. Yet they lack reliability and persistence and remain inscrutable.
Realizing AI’s potential needs approaches that work across datasets, domains,
modalities, and forms of cognition. LLM demonstrations tend to <em>lower-bound</em>
capabilities (emergent skills — step-by-step reasoning, summarization, analogies — keep
appearing with scale), producing a “capability overhang.” Cremer identifies five expert
disagreements (abstraction, generalization, causal models, emergent planning,
intervention; we add sample efficiency). Scaling laws have held over eight orders of
magnitude, with <em>data</em> now the limiting factor. A key milestone is a virtuous cycle
where AI improves AI research (ML optimizing ML inputs; aiding researchers; AutoML;
direct self-improvement via self-play). <em>2024 update:</em> the strong “stochastic parrot”
hypothesis is disconfirmed, but most tasks appear 20–60% memorization. Capabilities are
upstream of all value and risk, are already well-funded by industry, and accelerating
them shortens the time available to solve the other problems.</p>

<p><strong>HP#2 — Assurance (worst-case performance).</strong> <em>“…safety and security, robustness,
performance and output challenges…especially in safety-critical applications.”</em> Most
advanced models perform far below customary engineering reliability, and because we
don’t understand how they work, we can’t yet detect and prevent dangerous modes. Key
approaches: <strong>systemic safety</strong> (safety is a social as much as technical problem);
<strong>monitoring/interpretability</strong> (mechanistic, concept-based, feature-based);
<strong>robustness</strong> (noise injection, diverse data, red-teaming, calibration, OOD
detection); and <strong>formal verification</strong> (proofs a system meets a specification,
progressing on smaller models). Any solution requires strict, credible, hard-to-game
real-world tests — most benchmarks are artificial and gameable; third-party testing for
dangerous actions (deception, self-replication) is emerging.</p>

<p><strong>HP#3 — Alignment.</strong> <em>“…safety and control, alignment, and compatibility with
increasingly powerful…AI…and eventually AGI.”</em> Whereas HP#2 prevents harm from
<em>incompetent</em> systems, HP#3 aligns <em>competent</em> AIs with human intentions — complicated by
human disagreement about values (intent alignment is one normatively-robust goal). Two
core problems: <strong>specification gaming / reward hacking</strong> (systems optimize the metric at
the expense of its purpose — e.g. the OpenAI hand that <em>pretended</em> to grasp a ball; you
can’t anticipate all loopholes); and <strong>emergent goals</strong> (power-seeking as a likely
attractor for capable planners; deception, e.g. Cicero; the worst case being <em>deceptive
alignment</em>, where a system hides undesirable properties from imperfect monitoring). The
dominant approach — iterated human feedback (automated via a proxy model) — selects for
systems that <em>appear</em> safe and can incentivize deception above a planning threshold.
Proposed directions: delegating parts of alignment to ML (Leike), AI interpreters /
Eliciting Latent Knowledge (Christiano), and LMs judging LMs. Alignment overlaps
assurance (HP#2) and responsible AI (HP#7); current and future risks form a continuum,
not a dichotomy.</p>

<p><strong>HP#4 — Beneficial applications.</strong> <em>“…game-changing contributions by AI to humanity’s
greatest challenges…health and life sciences, climate, foundational science…and
mathematics.”</em> Scientific discovery (AlphaFold’s two-orders-of-magnitude jump in
predicted protein structures; ML proxies for expensive quantum simulation); energy &amp;
climate (grids, transport, fusion); chemistry/materials; software &amp; algorithm design
(LMs writing ~3% of new Google code; AlphaCode at median competitive-human level;
a novel matrix-multiplication algorithm); healthcare (Hinton’s “stop training
radiologists” was overstated — ~40% of EU radiologists now use AI tools, far short of
automation; AI-suggested drugs entering trials). Uptake is slowed by rampant bad
methodology (only 12.5% of diagnostic studies had a test set; none of 62 Covid imaging
models were clinically usable). Pattern: fields with abundant data, strong theory, and
stationary distributions benefit most; current revenue concentrates where specifications
are imprecise and the cost of error is low.</p>

<p><strong>HP#5 — Economics.</strong> <em>“…the economic challenges and opportunities resulting from AI.”</em>
Keynes foresaw a 15-hour week by compound interest, not labor automation; the dissenting
“technology eliminates jobs, not work” assumes a creation/destruction balance AI may
disrupt. Three sub-problems: (1) <strong>growth</strong> — despite progress, total-factor-productivity
growth has declined for 50 years (the Solow/productivity paradox; AI may follow
electrification’s ~25-year lag); Trammell &amp; Korinek find no compelling reason to dismiss
even super-exponential scenarios. (2) <strong>labor &amp; inequality</strong> — flawed evidence (narrow
definitions of AI); one cited estimate puts 35% of UK / 47% of US workers at displacement
risk; early LLM studies show productivity gains (e.g. median customer-support workers)
but also wage decreases; Acemoglu &amp; Restrepo list four countervailing positive effects.
(3) <strong>policy</strong> — reliability is a serious obstacle (the dismantled Polish unemployment
system); UBI, hiring incentives, and the “AI Economist” are options. Capability — and
thus labor-shock — timing is unpredictable; few institutions are prepared.</p>

<p><strong>HP#6 — Participation / democratizing access.</strong> <em>“…democratizing access, participation,
and agency…especially those not presently involved.”</em> The AI workforce doesn’t represent
world demographics (disproportionately men from the US and China). <strong>Within-country</strong>:
under-representation (&lt;25% of CS PhDs are women, &lt;2% Black/African American; only ~5–7% of
Google/Microsoft employees are Black); beware “participation washing.” Also the
<strong>privatization</strong> of AI (net migration of researchers from academia to industry).
<strong>Across countries</strong>: unequal access to compute, data, and talent. Solving it requires
addressing inequality both within and between countries.</p>

<p><strong>HP#7 — Responsible AI / sociotechnical embedding.</strong> <em>“…responsible research,
deployment, and sociotechnical embedding…accounting for different cultures…externalities,
and market and other forces.”</em> Negative impacts: under-recognized clickwork (mostly in
the global south; a ~$13.7B-by-2030 annotation market with little concern for workers’
rights); environmental cost (inference may dominate emissions; little transparency);
discrimination/toxicity/bias (e.g. a Michigan fraud system with a 93% false-discovery
rate); privacy (GDPR/CCPA; federated learning, differential privacy; surveillance);
security. A flurry of ~84 principles documents mostly post-2016 — but they’re usually
“orphaned” (no technical agenda or enforcement), systematically overlook future systems
and capability risks, and can amount to “ethics-washing” (Chinese AI principles vs
documented human-rights uses). Turning principles into action: model cards, broader-
impacts statements, whistleblower protection and worker power (the Google military-
project walkout), and AI tools that <em>correct</em> misinformation (Meta’s Sphere).</p>

<p><strong>HP#8 — Geopolitics.</strong> <em>“…risks around its use and misuse, competition, cooperation, and
coordination between countries and other key actors.”</em> AI will transform international
security with little consensus even on vocabulary; managing its impacts is a collective-
action problem with diffuse, delayed harms. <strong>Military destabilization</strong> (erosion of
strategic stability and nuclear deterrence; cheaper, scalable cyber-threats;
proliferation to non-state actors; leading nations’ reluctance to regulate military AI).
<strong>Other interactions</strong> (AI as a “general-purpose technology” redistributing hard and soft
power; the “arms race” framing is inaccurate — better a winner-take-most virtuous circle
turning on data, hardware, software, talent, and chokepoints). The US and China lead
capabilities while the EU leads regulation, driving regulatory fragmentation and
arbitrage; the US approach is largely hands-off plus industrial policy (CHIPS Act). Some
AI tools (translation, monitoring) may improve information flow and reduce conflict.</p>

<p><strong>HP#9 — Governance / institutional resilience.</strong> <em>“…adaptation, co-evolution and
resiliency of institutions and social infrastructure to keep up with and harness AI.”</em>
(“We are…building the plane as it is taking off.”) Focus on domestic governance:
policy-level “hard law” and organizational “soft law.” The <strong>Collingridge dilemma</strong> —
regulators must act before impacts are understandable — is acute here because AI is “a
technology which can misuse itself”; Dafoe’s four risk sources: robust totalitarianism;
nuclear war; misaligned systems; systematic value erosion from competition. Gasser &amp;
Almeida’s five governance layers (technical, responsibility, regulation, public policy,
collaborative); leading tools include <strong>compute governance</strong> (tracking high-end chips),
procurement rules, standards bodies, certification, verifiable-claims mechanisms, and
new instruments like private regulatory markets. Much of the real policy work is
“illegible” (only PR-friendly material is published).</p>

<p><strong>HP#10 — The human condition.</strong> <em>“…what it means to be human in the age of AI.”</em> If AI
does your job better than you, what does that mean for you as a moral agent and human?
Danaher’s <strong>severance problem</strong>: full automation severs the connection between human
effort and the world improving (and the loss of positional goods); it’s only partially
mitigated by walling off certain fields, since the essence is that you <em>could</em> be
replaced even if you aren’t. Related worries: AI excellence inspiring passivity and
societal stagnation — a “bystander effect for values” — which threatens the voluntary
action, activism, and oversight that democracies and moral progress depend on.</p>

<h3 id="are-the-hard-problems-wicked-problems">Are the Hard Problems “wicked problems”?</h3>

<p>Calling them “hard” might connote unsolvability (“wicked problems” defy solution due to
their entanglement of facts and values and lack of a definitive formulation). The paper
concludes there is hope for <em>partial</em> solutions to suitably <em>sharpened</em> versions of the
problems. Those that directly invoke human values (HP#3, HP#6–9) are the better
candidates for genuine wickedness.</p>

<h3 id="outlook">Outlook</h3>

<p>The most certain progress is on HP#1 and HP#4 — “the accelerator pedal” — because
researcher and commercial incentives already point there. The other problems (assurance,
alignment, the economic, participatory, responsible, geopolitical, governance, and
philosophical challenges) are comparatively under-funded and under-staffed, even as
capability gains shorten the time available to address them.</p>