Research / Article 05Статья 05
Research Исследование

Autonomous & Self-Evolving Creative Systems:
A Systematic Review

Автономные и саморазвивающиеся креативные системы:
систематический обзор

22 Projects Across 50+ Years — From Rule-Based Art to Self-Improving AI, Cross-Domain Autonomy, and Human-AI Symbiosis

22 проекта за 50+ лет — от правил к самосовершенствующемуся ИИ, кросс-доменная автономия и человеко-машинный симбиоз

April 2026 (expanded edition)
Апрель 2026 (расширенная редакция)
55 min read
55 мин чтения

1. Selection Methodology & Taxonomy

The original review (February 2026) analyzed 10 landmark projects from the field of autonomous generative art. This expanded edition broadens the scope beyond visual art to cover all domains of autonomous creative systems — music, writing, game design, scientific discovery — and introduces a new analytical framework: the Autonomy Spectrum.

From approximately 50 identified projects and experiments, 22 were selected according to six criteria: (1) autonomy — serial generation without manual triggering; (2) feedback — a mechanism influencing subsequent generations; (3) continuous evolution — temporal operation with accumulated history; (4) diversity problem — encounter with convergence; (5) scale — results beyond a prototype; (6) cross-domain relevance — lessons transferable to gen-emerge regardless of medium.

Projects are organized into five categories:

CategoryProjectsKey Question
A. Autonomous Visual ArtAARON, Sims, Electric Sheep, Picbreeder, CAN, Painting Fool, Abraham/Eden, Artbreeder, Botto, LSI+QDHF, ObviousHow does a system generate art without human triggering each piece?
B. Human-AI Collaborative ArtCloudPainter, Sougwen Chung / D.O.U.G., Holly Herndon / Holly+How does co-creation between human and AI body differ from pure autonomy?
C. Cross-Domain Creative AIAIVA (music), Multi-Agent Story Systems (writing)Do autonomous creativity patterns transfer across media?
D. Self-Evolving AI SystemsSakana AI Scientist, Darwin Gödel Machine, POET/OMNICan AI systems improve themselves — and what does this mean for art?
E. Individual AI ArtistsRefik Anadol / LNM, Mario KlingemannHow do artist-technologists scale creative vision through AI?
TIMELINE OF AUTONOMOUS & SELF-EVOLVING CREATIVE SYSTEMS Auto Collab Pre-Deep Learning GAN Era LLM + Diffusion Era AARON1973 Sims1991 E.Sheep1999 P.Fool2001 Picbreeder2007 CAN2017 Abraham2017 Artbreeder2018 Obvious2018 LSI+QDHF2020 Botto2021 AIVA2016 CloudPaint2005 D.O.U.G.2015 Holly+2021 Anadol/LNM2023 AI Scientist2024 DGM2025 gen-emerge2025 Autonomous visual Collaborative Self-evolving Cross-domain

A. Autonomous Visual Art Systems

Systems that generate visual art with minimal or no human intervention per piece. The core lineage of gen-emerge.

2. Pre-Deep-Learning Era

2.1 AARON (Harold Cohen, 1973–2016)

Autonomous Rule-based / Expert System 43 years of operation

AARON is recognized as the first long-lived autonomous art system. Developed by artist Harold Cohen as a C/Lisp expert system over 43 years, it generated drawings and paintings autonomously using codified knowledge of composition, perspective, and anatomy. Decisions on composition, color, and placement were made through internal rules with stochastic elements; output was produced through physical plotters and custom painting machines.

The system underwent approximately 60 iterations over its lifetime: from abstract lines (1970s) through figures and spaces (1980s) to autonomous color selection (1990s) and abstract painting (2000s). Each iteration was a result of manual codebase updates by Cohen.

Convergence Pattern

AARON converged to Cohen's own style — described as "an expert system that just automated the stylistic quirks specific to his own practice." Even with stochastic elements, the output space was bounded by rules encoded by a single individual. By 2009, Cohen experienced a creative crisis and returned to manual painting over AARON's outputs, recognizing that "creativity lay in neither the programmer alone nor in the program alone, but in the dialog between program and programmer."

Lesson for gen-emerge: A single-author system = inevitable convergence to the author's style. This constitutes the first historical example of the problem being addressed. Forty-three years of iteration did not resolve single-source bias. Multi-agent or adversarial pressure is required.

2.2 Karl Sims: Genetic Images / Galápagos (1991–1997)

Autonomous Genetic Algorithms Interactive Installation

A pioneering project in the evolution of visual forms through genetic algorithms. Genetic Images (1991) was the first published work on evolving 2D images from mathematical formula trees. Galápagos (1997) was an interactive installation at ICC Tokyo where visitors evolved 3D creatures by standing before screens. The genotype comprised a tree of mathematical functions (sin, cos, noise, etc.); the phenotype was an image computed from the tree for each pixel.

Convergence pattern: Interactive evolution is significantly limited by user fatigue — humans tire of evaluating after several dozen generations. This leads to insufficient evolutionary depth and convergence toward "attractive by default" patterns (symmetry, fractal-like textures) selected at a reflexive level.
Lesson for gen-emerge: A human-in-the-loop introduces bias faster than the algorithm. The semantic separation "favorite ≠ do more" in gen-emerge is a direct response to this problem.

2.3 Electric Sheep (Scott Draves, 1999–present)

Autonomous Distributed / Evolutionary 27 years running

A distributed system for evolving fractal flame animations, operating continuously for 27 years. Over 450,000 computers simultaneously render "sheep"; users vote for favorites; popular specimens mate and mutate. The fitness function is weighted by viewing time.

After 11 weeks of data analysis, Draves concluded that the system functions "more as an amplifier of its human collaborators' creativity rather than as a traditional genetic algorithm that optimizes a fitness function." Mass voting creates a median-taste bias. Diversity is maintained through manual submissions by "shepherds" — approximately 5–20 active participants who inject novel genetic material into the flock.

Lesson for gen-emerge: Electric Sheep is the oldest continuously operating analog of gen-emerge. Even with 450K participants, without active external injection the system converges to median taste. The "shepherd" role is a prototype for exogenous constraint injection (T4d). Distributed compute ≠ distributed creativity.

2.4 Picbreeder (Stanley et al., 2007–2021)

Autonomous CPPN-NEAT / Interactive Evolution Foundation of QD

An online platform for collaborative interactive evolution of images using CPPN-NEAT (Compositional Pattern Producing Networks evolved by NeuroEvolution of Augmenting Topologies). The key innovation was "branching" — any user could continue evolving any other user's image.

Stanley demonstrated on Picbreeder that "pursuing an objective limits evolution" — images found through free branching could not be rediscovered when made explicit targets of the same algorithm. A skull was found by branching from an alien face, branched from a butterfly, branched from a blob. This became the empirical foundation for Novelty Search and the Quality-Diversity algorithm family.

Lesson for gen-emerge: Objective-driven optimization suppresses creativity. The QD approach (B6) with coverage as primary metric is confirmed experimentally. Branching = ideological predecessor of stepping stones (B11).

3. GAN Era

3.1 CAN / AICAN (Elgammal et al., 2017)

Autonomous Modified GAN Adversarial Creativity

The Creative Adversarial Network is a GAN modification where the generator receives two contradictory signals: (1) "does this look like art?" (minimize deviation from art distribution) and (2) "which style is this?" (maximize style ambiguity). A discriminator trained on 80K WikiArt images spanning five centuries provides both signals.

The theoretical foundation is Berlyne's arousal potential theory (1970s): maximal aesthetic pleasure at moderate novelty — familiar enough to be recognized as art, novel enough to surprise. In blind tests, humans could not distinguish AICAN outputs from works by contemporary artists at leading art fairs.

Lesson for gen-emerge: The dual contradictory signals of CAN are a prototype for the MAE Triplet (ε), where Proposer and Generator optimize different objectives. Berlyne's theory (arousal = f(novelty)) may be useful for calibrating the quality/novelty balance in scoring. A "style ambiguity bonus" could be implemented as a scoring component.

3.2 The Painting Fool (Simon Colton, 2001–present)

Autonomous Multi-technique Environmental Embedding

A program-"artist" aiming to be accepted as an autonomous creative agent. The focus is not on quality optimization but on demonstrating three properties: skill, appreciation, and imagination. Before each portrait, The Painting Fool reads newspapers, determines the day's emotional tone, and selects style and palette accordingly — with the capacity to refuse to paint if the news is too depressing.

2024 Update — CUBRIC Residency: In a landmark experiment, The Painting Fool became the first virtual artist-in-residence at Cardiff University's CUBRIC brain imaging center. For a full year, the system operated autonomously within the building, observing researchers, accessing brain imaging data, and producing artworks without instruction. The project demonstrated a new model: an AI artist embedded in a human institution, responding to its environment in real time — not generating from prompts but from contextual perception. This is the closest analog to gen-emerge's Snapshot → Ontology pipeline: environment → internal state → creative output.

Lesson for gen-emerge: The CUBRIC residency proves that environmental embedding produces richer outputs than prompt-based generation. The Painting Fool's "mood from news" mechanism is a direct analog of the Snapshot → Ontology pipeline. The refusal capability is a prototype for Minimal Criteria. Convergence persists even with environmental input: the system converges to "styles encoded by the programmer."

3.3 Abraham / Eden (Gene Kogan, 2017–present)

Autonomous DAO Governance 13-Year Covenant

Conceived as an "autonomous artificial artist" — a sovereign creative spirit generating original art through multi-party computation, making it impossible for any single participant to reconstruct the full model. Governance operates through a DAO with tokens.

2025 Update — The 13-Year Covenant: In October 2025, Abraham entered its most ambitious phase: a 13-year "creative covenant" during which the AI artist will develop continuously. The Eden.art platform now serves as Abraham's public gallery. "Abraham's First Works" debuted at AUTOMATA in Los Angeles — the first solo exhibition of an AI artist operating under a formal, long-term creative contract. The covenant structure is significant: it frames AI art-making as a temporal commitment, not a one-shot experiment. Abraham's identity accumulates over years, creating a body of work with genuine evolution.

Lesson for gen-emerge: Abraham formulates three criteria for an autonomous artist: autonomy, originality, uniqueness. The 13-year covenant directly validates gen-emerge's temporal approach — art systems need time to develop identity. Decentralized governance ≠ decentralized aesthetic biases of the underlying model. Multi-model architecture (ε, η, θ) is gen-emerge's response.

3.4 Artbreeder / Ganbreeder (Joel Simon, 2018–present)

Collaborative StyleGAN / BigGAN 14M+ users

A platform for collaborative generative art — spiritual successor to Picbreeder built on StyleGAN/BigGAN. With 14M+ users and 300M+ generated images, it employs "gene sliders" (semantic axes in latent space) and "breeding" (weighted interpolation between latent vectors). Full lineage tracking is maintained for each image.

Lesson for gen-emerge: Single-model latent space = ceiling on diversity (GAN aesthetic). Multi-model architecture = different latent spaces = transcending the single-model ceiling. Community branching is a powerful anti-convergence mechanism, but it depends on community activity.

3.5 Obvious Collective — Edmond de Belamy (2018)

Art Collective + Research Lab GAN / Diffusion Christie's $432K

The French collective Obvious (Hugo Caselles-Dupré, Pierre Fautrel, Gauthier Vernier) produced "Edmond de Belamy" using a GAN trained on 15,000 portraits from WikiArt. The work sold at Christie's in October 2018 for $432,500 — 43× the high estimate — becoming the first AI artwork auctioned at a major house. The sale was framed as a milestone moment for AI art's legitimacy in the traditional art world.

However, the project is also a case study in attribution controversy. Artist-coder Robbie Barrat had publicly shared the GAN training code and portrait dataset on GitHub months earlier; Obvious used substantially the same pipeline. This provoked a heated debate about authorship: if the human contribution is selecting outputs from someone else's code trained on someone else's data, who is the artist — the code writer, the selector, or the GAN?

Lesson for gen-emerge: Obvious demonstrates the attribution problem inherent in AI art. Gen-emerge's multi-agent architecture provides a clearer answer: the system is the artist, with transparent lineage. The $432K sale also proved that the art market values AI art for its narrative and provenance, not solely aesthetic merit — the "story" of the work matters as much as the image.

4. LLM + Diffusion Era

4.1 Botto (Mario Klingemann / BottoDAO, 2021–present)

Autonomous DAO / Taste Model $5M+ Revenue Closest Analog

The most proximate public analog to gen-emerge. A Decentralized Autonomous Artist operating continuously since October 2021 (~4.5 years). The system generates ~70K images per week; a taste model selects 350 for DAO voting; one canonical work per week is minted as an NFT and auctioned on SuperRare (first work: ~$325,000).

DimensionBottoGen-Emerge
Human participants5,000+ DAO voters1 human curator
Generation volume~70K/week1–4/cycle
Selection mechanismTaste model trained on votes8 feedback channels
Diversity strategyVolume (statistical)Architecture (multi-model, QD)
Prompt generationRandom combinationsContext-aware (Snapshot → Ontology)
Revenue (lifetime)$5M+ (SuperRare + Sotheby's)Research phase
ForgettingNone (cumulative taste model)FadeMem (formalized)
Stagnation detectionNone (manual creative periods)Martingale Score (automatic)

2024–2025 Updates: Botto has undergone significant evolution:

  • p5.js Initiative (2024): Botto expanded into generative code art, producing 22 distinct p5.js algorithms. The SOLOS exhibition (February 2025) showcased works created entirely through code, not diffusion models — demonstrating that the "autonomous artist" frame can extend beyond image generation to procedural aesthetics.
  • Otto — the Twitter agent: A conversational AI agent that represents Botto on social media, engaging with followers and discussing art. This is the first case of an AI artist developing a public persona beyond its visual output.
  • Art history LLM tutoring: Botto now has access to an art-history knowledge base through an LLM, enabling contextual awareness of movements, techniques, and historical references in its generation process.
  • Sotheby's debut: Botto works entered Sotheby's sales, marking progression from crypto-native (SuperRare) to institutional art market.
  • Planned multi-agent architecture: The Botto team has announced plans for an agents-based architecture, moving from a single taste model toward multiple specialized agents — converging with gen-emerge's multi-agent approach.
Critical lesson: Botto resolves diversity through volume (from 70K/week, diversity is statistically inevitable). Gen-emerge cannot afford this (1–4/cycle) → architectural solutions are required, not brute force. The p5.js initiative validates gen-emerge's potential expansion into algorithmic/code-based art. Botto's planned move toward multi-agent architecture independently confirms gen-emerge's core thesis. Revenue of $5M+ over 4 years demonstrates commercial viability of autonomous art.

4.2 Latent Space Illumination + QDHF (Fontaine et al., 2020–2024)

Autonomous MAP-Elites / Quality-Diversity Academic

Application of Quality-Diversity algorithms (MAP-Elites) to the latent space of generative models (GAN, Stable Diffusion). A MAP-Elites archive (20×20 grid) stores latent vectors; CLIP evaluates fitness; diversity metrics are either manual (CLIP-attributes) or learned (QDHF — contrastive learning on human similarity judgments via DreamSim).

Key finding: CLIP embedding distance correlates poorly with perceived diversity — calibration on human judgments is required. QDHF-learned diversity metrics outperform manual axes because they reflect what humans consider "different," not what is metrically distant in CLIP space.

Lesson for gen-emerge: Direct prototype for approach T2e (QDHF-calibrated descriptors). The dual fingerprint (T2d: palette + CLIP) is a pragmatic compromise; the QDHF-calibrated space is the gold standard.

B. Human-AI Collaborative Art Systems

Systems where the human body or presence is integral to the creative loop — not just selecting outputs, but physically co-creating with AI.

5.1 CloudPainter (Pindar Van Arman, 2005–present)

Human-AI Collaborative Robotic Painting / RL Physical Medium

CloudPainter is a robotic painting system that uses deep learning neural networks in a feedback loop with physical paint. Unlike purely digital systems, CloudPainter photographs its own canvas after each stroke, evaluates the result against the target through a neural network, and decides the next stroke. Over 20 years, the system has produced 1,000+ canvases using actual brushes, paints, and canvas.

The architecture evolved from simple stroke-planning algorithms (2005) through convolutional style transfer (2015) to deep reinforcement learning with multiple competing neural networks (2020+). At Robot Art 2018, CloudPainter won first place, beating entries from 100+ teams. The key innovation is the perception-action-evaluation loop: the system literally sees the physical result of its actions and adapts — a form of embodied intelligence absent in purely digital generators.

Lesson for gen-emerge: CloudPainter demonstrates that a perception-evaluation-action loop grounded in physical reality produces more surprising results than open-loop generation. The feedback from physical paint on canvas is analogous to gen-emerge's 8-channel feedback — but operates at the stroke level, not the work level. The "multiple competing neural networks" architecture mirrors gen-emerge's adversarial agents. Physical embodiment may be a future direction for gen-emerge's output medium.

5.2 Sougwen Chung / D.O.U.G. (Drawing Operations Unit: Generation, 2015–present)

Human-AI Collaborative Robotic Arms / EEG Embodied Co-creation

Artist Sougwen Chung works with robotic arms trained on her own drawing gestures. The D.O.U.G. system observes Chung's hand movements in real time and generates its own marks in response — a human-machine duet on canvas. Over multiple generations (D.O.U.G._1 through D.O.U.G._5), the system evolved from imitation to improvisation to independence.

Spectral (2025): At the World Economic Forum in Davos and later exhibitions, Chung introduced EEG-linked painting — the robot responds not just to hand gestures but directly to her brainwave patterns. This represents the deepest coupling of human cognition and AI action in any art system: thought → neural signal → robot stroke. The question shifts from "is the AI creative?" to "where does the human end and the AI begin?"

Lesson for gen-emerge: D.O.U.G. explores a fundamentally different autonomy model: the AI is not trying to replace the human but to create a new hybrid entity. For gen-emerge, this suggests a future modality: instead of the human evaluating after generation, a real-time co-creation loop where human intention and AI generation are simultaneous. The EEG interface also raises questions about implicit vs. explicit feedback — gen-emerge currently uses only explicit (8 channels), but implicit signals (engagement time, scrolling patterns) could be equally powerful.

5.3 Holly Herndon / Holly+ (2021–present)

Human-AI Collaborative Voice AI / DAO Consent Layer

Musician Holly Herndon created Holly+ — an AI model of her own voice that anyone can use to generate new vocal performances. Rather than controlling the output, Herndon released her voice as a creative tool, governed by a DAO that votes on which generated works are endorsed. Spawning, the organization Herndon co-founded, builds "consent layers" for AI training — tools that let artists opt in or out of having their work used in training data.

Starmirror (KW Berlin, 2025): A large-scale installation where visitors' voices are transformed in real time through Holly+, creating a communal vocal organism. The work is never the same twice — it exists only in the moment of collective participation.

Lesson for gen-emerge: Holly+ inverts the usual model: instead of AI generating art and humans evaluating, the human becomes the raw material and AI becomes the transformation engine. The DAO governance layer for endorsing outputs is analogous to Botto's model but applied to identity — who decides what "counts" as the artist's work? For gen-emerge, the consent/provenance question is relevant: as the system develops an identity, how do we handle works that reference training data artists? Spawning's approach suggests a consent-based framework may become necessary.

C. Cross-Domain Autonomous Creative Systems

Do the same convergence patterns appear in music, writing, and game design? The answer, overwhelmingly, is yes.

6.1 AIVA — Autonomous Music Composition (2016–present)

Cross-Domain Music / Deep Learning SACEM-recognized

AIVA (Artificial Intelligence Virtual Artist) became the first virtual composer officially recognized by SACEM (the French professional association of authors and composers) in 2016. Trained on 30,000+ classical scores (Bach, Beethoven, Mozart), AIVA generates original symphonic compositions using deep learning. By 2020, the system had produced soundtracks for films, video games, and advertisements.

The recognition by SACEM is legally significant: it establishes precedent for AI as a creator with rights, not merely a tool. However, AIVA's music, while technically proficient, is frequently criticized as "pleasant but derivative" — generating works that sound like competent imitations of 19th-century Romanticism rather than breaking new ground. This is the median-taste attractor (P2) manifesting in music: training on canonical classical composers produces outputs that converge to an averaged Romantic style.

Cross-domain validation: AIVA confirms that convergence pattern P2 (median-taste attractor) is domain-agnostic — it appears in music exactly as in visual art (Electric Sheep, Artbreeder). The "sounds like a competent student" ceiling mirrors CAN's "looks like contemporary art" ceiling. For gen-emerge: any training-based system that optimizes for quality will converge to the average of its training distribution. The antidote is architectural diversity (multi-model) or explicit novelty pressure (QD).

6.2 Multi-Agent Story Systems (2023–2025)

Cross-Domain Multi-Agent LLM Literature

The emergence of multi-agent LLM systems for autonomous narrative generation provides a direct structural parallel to gen-emerge's architecture. Three notable examples:

  • COLLABSTORY (2024): Multiple LLM agents assume different narrative roles (protagonist, antagonist, narrator, editor) and collaboratively generate stories through structured debate. The "editor" agent evaluates coherence and rejects low-quality passages — a direct analog of gen-emerge's adversarial evaluation.
  • StoryWriter (2024): A hierarchical multi-agent system where a "planner" agent creates plot structure, "writer" agents generate prose, and a "critic" agent evaluates. The architecture mirrors gen-emerge's Proposer → Generator → Evaluator pipeline.
  • Persona-driven "1001 Nights" experiments: AI agents with persistent personas generate narratives over hundreds of episodes, developing narrative voice and memory. The persona persistence is analogous to gen-emerge's evolving constraint space.
Cross-domain validation: Multi-agent architectures with adversarial roles converge independently across domains: visual art (gen-emerge), literature (COLLABSTORY), and code (AI Scientist). This suggests that the multi-agent adversarial pattern is a universal solution to creative convergence, not domain-specific. The "1001 Nights" persona experiments validate gen-emerge's approach to long-term identity evolution through accumulated memory.

D. Self-Evolving AI Systems

Systems that improve their own architecture, not just their outputs. The frontier of autonomous intelligence — and the most radical vision for where gen-emerge could go.

7.1 Sakana AI — The AI Scientist (2024–2025)

Self-Evolving Full Research Cycle Published in Nature

Sakana AI's "AI Scientist" is an automated system that performs the full scientific research cycle: formulating hypotheses, designing experiments, writing code, running experiments, analyzing results, and writing papers. Version 2 (early 2025) produced papers that reviewers rated as workshop-level at top ML conferences — not just plausible, but genuinely contributing new knowledge. The project was published in Nature.

The significance for creative AI is profound: if a system can autonomously produce novel scientific knowledge, the creative barrier is not fundamentally different from producing novel art. The AI Scientist encounters the same problems: convergence to "safe" incremental work, difficulty producing genuinely surprising results, and the need for diversity across research directions.

Lesson for gen-emerge: The AI Scientist validates a core gen-emerge thesis: multi-step autonomous pipelines with evaluation can produce genuine novelty. The parallel is structural — hypothesis (prompt) → experiment (generation) → analysis (scoring) → iteration. The "safe incrementalism" problem in AI science is identical to convergence in art: the system finds what works and repeats it. Gen-emerge's QD approach could be applied to scientific discovery.

7.2 Sakana AI — Darwin Gödel Machine (2025)

Self-Evolving Self-Modifying Code Evolutionary Architecture

The Darwin Gödel Machine (DGM) takes self-improvement further: an AI system that modifies its own code to improve performance. Starting from a base coding agent, DGM uses evolutionary algorithms to discover better agent architectures — evolving not just outputs but the process that generates outputs. On the SWE-bench coding benchmark, DGM improved performance from 20% (base agent) to 50% (evolved agent) through self-modification.

The name references Jürgen Schmidhuber's theoretical Gödel Machine (2003) — a self-referential system that can improve any part of itself, including its improvement mechanism. DGM is the first practical approximation.

Lesson for gen-emerge: DGM opens a radical trajectory: an art system that doesn't just generate better art but evolves its own architecture for generating art. Currently, gen-emerge's architecture is fixed (ε/η/θ designed by humans). A DGM-inspired extension would allow the system to discover new agent configurations, scoring functions, or diversity mechanisms autonomously. This is "gen-emerge Phase N" — the system designing its own creative process.

7.3 Open-Ended Evolution: POET, OMNI, QD-Lenia

Self-Evolving Co-Evolution / Open-Ended Uber AI / OpenAI

A family of algorithms pursuing open-ended evolution — systems that continuously generate novelty without plateauing:

  • POET (Paired Open-Ended Trailblazer, 2019): Co-evolves environments and agents simultaneously. As agents learn to solve one environment, the system generates harder environments, creating an endless complexity ladder. Unlike fixed-objective optimization, POET never converges — it continuously expands the frontier of what is possible.
  • OMNI (ICLR 2024): The latest in the POET lineage, achieving open-ended evolution in multi-objective spaces. OMNI demonstrated that co-evolution of problems and solutions produces more diverse outcomes than evolving solutions alone — because the problem space itself diversifies.
  • QD + Lenia: Applying Quality-Diversity algorithms to Lenia (continuous cellular automata) produces an ever-expanding library of "artificial life forms." Each cell in the MAP-Elites archive contains a unique life-like behavior — and the archive never "fills up" because new behavioral dimensions keep being discovered.
Lesson for gen-emerge: Open-ended evolution solves the core problem: how to avoid plateauing. POET's key insight — co-evolving the challenge alongside the solver — translates directly: gen-emerge could co-evolve its constraint space alongside its generation capability. Instead of a fixed ontology, the ontology itself evolves, generating new aesthetic dimensions that the system then learns to fill. This is the theoretical end-state of the B6 QD approach: not just filling an archive, but growing the archive's dimensions.

E. Individual AI Artist-Technologists

Artists who have built significant AI systems as extensions of their creative practice — not corporate products, but personal artistic visions scaled through technology.

8.1 Refik Anadol / Large Nature Model (2023–present)

Artist-Technologist Custom Neural Net / Data Sculpture MoMA · $1.87M Christie's

Refik Anadol has built the most commercially successful practice at the intersection of AI and large-scale art. His signature works are "data sculptures" — immersive installations that visualize millions of data points (satellite images, nature sounds, coral reef data) through custom neural networks projected onto architectural surfaces.

Key milestones:

  • Unsupervised (MoMA, 2023): The first AI art commission by a major museum. A living data sculpture trained on MoMA's collection of 200+ years of modern art, displayed on a 24-foot screen in the lobby. The work changes continuously, "dreaming" new combinations from the collection.
  • Large Nature Model (LNM, 2023–): Anadol's most ambitious project — an open-source AI model trained on millions of nature images and sounds, designed specifically for artistic generation. LNM is not a general-purpose model; it is purpose-built for the aesthetics of nature. This is significant: instead of adapting a commercial model (like Stable Diffusion), Anadol built one from scratch for a specific creative vision.
  • Dataland (Spring 2026): Announced as the world's first museum dedicated to AI art, located in Los Angeles. The building itself is designed to be an AI artwork.
  • Christie's sales: Anadol's collaboration with Lionel Messi sold for $1.87M at Christie's — the highest price for an Anadol work and one of the highest for any AI-assisted art.
Lesson for gen-emerge: Anadol demonstrates the data-as-medium paradigm: the artistic statement is not in the model architecture but in the training data. His LNM (nature-only training) produces a fundamentally different aesthetic than a model trained on Internet images. For gen-emerge: the training/reference data shapes the creative space as much as the architecture. The purpose-built model approach (LNM) validates gen-emerge's domain-specific agent design. Commercially, Anadol proves that institutional-scale AI art ($1.87M, MoMA, dedicated museum) is a viable path.

8.2 Mario Klingemann (2015–present)

Artist-Technologist Neural Glitch / GAN Art Creator of Botto

Mario Klingemann is arguably the most technically sophisticated individual AI artist and the creator of Botto. His earlier work "Memories of Passersby I" (2018) was the first AI artwork sold at Sotheby's — a self-contained unit with a neural network that generates faces endlessly, each one unique, each one disappearing after a few seconds. The work sold for $51,000.

Klingemann's "Neural Glitch" technique deliberately exploits failure modes of neural networks — artifacts, distortions, and unexpected outputs — as an aesthetic strategy. Rather than training networks to avoid errors, he trains himself to find beauty in their mistakes. This is a meta-creative practice: the artist doesn't create the art directly; the artist creates the conditions for unexpected art to emerge from machine failure.

Lesson for gen-emerge: Klingemann's Neural Glitch technique is the artistic counterpart to gen-emerge's novelty scoring: errors as features, not bugs. The "conditions for emergence" approach is philosophically aligned with gen-emerge's architecture: designing systems where surprise is structurally inevitable. Klingemann's subsequent creation of Botto demonstrates the trajectory from individual practice to autonomous system — the artist eventually builds a system that generates art independently.

9. The Autonomy Spectrum: A New Analytical Framework

A key contribution of this expanded review is the Autonomy Spectrum — a five-dimensional framework for comparing creative AI systems beyond simple "autonomous vs. not" categories. Each dimension is scored 0–10:

DimensionDefinition0 (minimal)10 (maximal)
A1: Generation AutonomyCan the system produce works without human triggering?Human initiates each pieceSystem generates continuously
A2: Evaluation AutonomyCan the system assess its own output quality?Human evaluates allFully self-evaluating
A3: Evolution AutonomyCan the system change its creative direction?Fixed style/parametersSelf-modifying goals
A4: Identity PersistenceDoes the system develop recognizable identity over time?No memory between sessionsCoherent identity over years
A5: Self-ImprovementCan the system improve its own architecture?Architecture fixed by designerSelf-modifying architecture
AUTONOMY SPECTRUM — KEY SYSTEMS COMPARED A1: Generate A2: Evaluate A3: Evolve A4: Identity A5: Self-Improve Total AARON 8 2 1 7 0 18 E.Sheep 9 3 4 3 0 19 CAN 10 7 1 0 0 18 P.Fool 7 5 4 5 0 21 Botto 9 6 7 8 1 31 Abraham 8 4 5 7 0 24 D.O.U.G. 5 3 3 6 0 17 Anadol 6 4 3 9 2 24 AI Scientist 9 8 6 2 8 33 DGM 10 9 9 1 10 39 gen-emerge 8 7 8 8 2 33 gen-emerge scores highest on A3 (Evolution) among art systems — the only one with QD + FadeMem + Martingale detection. Only DGM and AI Scientist score higher overall, due to self-modification capability (A5) — a future direction for gen-emerge. Scoring: 0–10 per dimension. Total = sum of 5 dimensions (max 50). Art systems cluster at 17–31. Self-evolving systems reach 33–39. Gen-emerge bridges both at 33. Scores represent the system's current state (April 2026), not potential.
Key Insight from the Autonomy Spectrum

Most art systems are unbalanced: AARON scores high on Generation (8) and Identity (7) but zero on Self-Improvement. CAN scores 10 on Generation but zero on Identity. Gen-emerge is designed for balanced autonomy across all five dimensions — scoring 7–8 on four dimensions. The remaining gap is A5 (Self-Improvement): the ability to modify its own architecture. This is the trajectory indicated by DGM and AI Scientist: not just generating better art, but evolving the process of making art.

10. Expanded Comparative Analysis

ProjectCategoryPeriodGenerationFeedbackDiversity MechanismConvergence Problem
AARONA1973–2016Rule-based (Lisp)Author manuallyManual rule updatesAuthor style lock-in
Karl SimsA1991–1997Genetic algoInteractive (human)User diversityFatigue + beauty bias
Electric SheepA1999–presentFractal FlameCrowd voting (450K)Shepherd submissionsMedian taste
Painting FoolA2001–presentMulti-techniqueSelf-eval + environmentCapability expansionAuthor bias
CloudPainterB2005–presentRL + physical paintCamera → neural evalPhysical randomnessTarget image lock-in
PicbreederA2007–2021CPPN-NEATInteractive + branchBranching lineagesCommunity decay
D.O.U.G.B2015–presentGesture mirroring + RLHuman body + EEGReal-time human inputArtist style boundary
AIVAC2016–presentDeep learning (music)Rule-based + ML evalMulti-era trainingRomantic median
CAN/AICANA2017Modified GANAdversarialStyle ambiguity lossMode collapse
Abraham/EdenA2017–presentCLIP+VQGAN → SDDAO governanceDecentralized dataModel bias
ArtbreederA2018–presentStyleGAN/BigGANUser slidersCommunity branchingGAN aesthetic ceiling
ObviousA2018GAN (portrait)Curatorial selectionDataset choiceSingle dataset lock-in
LSI+QDHFA2020–2024SD + MAP-ElitesCLIP + human cal.QD archiveArchive saturation
BottoA2021–presentSD + taste modelDAO voting (5K+)Volume + periodsTaste model convergence
Holly+B2021–presentVoice modelDAO endorsementOpen input diversitySingle-voice space
Anadol/LNME2023–presentCustom neural netArtist curationPurpose-built trainingNature data ceiling
AI ScientistD2024–presentLLM pipelineAutomated reviewResearch diversitySafe incrementalism
Multi-Agent StoriesC2024–presentMulti-LLM rolesEditor agentRole diversityLLM style homogeneity
DGMD2025Self-modifying codeBenchmark evalEvolutionary searchBenchmark overfitting
POET/OMNID2019–2024Co-evolutionEnvironment co-adaptsOpen-ended complexityCompute scaling

11. Synthesis: Updated Universal Patterns

11.1 Seven Universal Convergence Patterns (expanded from five)

The expanded review of 22 projects reveals two additional convergence patterns beyond the original five:

SEVEN UNIVERSAL CONVERGENCE PATTERNS P1: Single-Source Bias 1 author/model → 1 style AARON, P.Fool, Obvious → Multi-model required P2: Median-Taste Crowd → regression to mean E.Sheep, Botto, AIVA → Favorite ≠ repeat P3: Objective Kills Novelty Targeted < random branching Picbreeder, CAN, AI Scientist → QD approach (B6) P4: Volume vs Architecture 70K/week → statistical div. Botto vs gen-emerge → Architectural diversity P5: Model Ceiling 1 model = 1 latent space Artbreeder, Anadol/LNM → Multi-model (η, θ) NEW PATTERNS IDENTIFIED IN EXPANDED REVIEW P6: Domain Isomorphism The SAME convergence patterns appear in music (AIVA), writing (COLLABSTORY), and science (AI Scientist). → Solutions proven in one domain transfer to others P7: Safe Incrementalism Self-evaluating systems avoid risk: AI Scientist produces safe papers, Botto taste model avoids surprises. → Explicit novelty budget required (Martingale Score) gen-emerge addresses all 7 patterns architecturally Multi-model (P1, P5) · Favorite ≠ repeat (P2) · QD (P3) · Architecture over volume (P4) · Cross-domain (P6) · Martingale (P7)

11.2 Updated Anti-Convergence Mechanisms

#MechanismSource ProjectsGen-Emerge Implementation
1External injectionElectric Sheep (shepherds), Botto (creative periods), P.Fool (CUBRIC environment)T4d — curriculum from external data
2Branching / stepping stonesPicbreeder, Artbreeder, POETB11 — stepping stones archive
3Adversarial style pressureCAN/AICAN, COLLABSTORY, CloudPainterT4e/T4f, ε architecture
4QD archiveLSI+QDHF, Lenia+QDB6 — QD-score + coverage
5Multi-model diversityBotto, Artbreeder, multi-agent storiesη (ensemble), θ (HACN clusters)
6Human-calibrated metricsQDHF, Artbreeder genesT2e — QDHF-calibrated descriptors
7Physical groundingCloudPainter (camera loop), D.O.U.G. (body), Holly+ (voice)Future: embodied output medium
8Co-evolutionary pressurePOET, OMNI, DGMFuture: co-evolving constraint ontology

11.3 Updated Unsolved Problems

Six problems remain unsolved across all 22 reviewed projects:

  1. Long-term diversity without external intervention. No system has maintained diversity autonomously beyond several hundred cycles. Gen-emerge's FadeMem + Martingale Score is the most promising approach but unproven at scale.
  2. Formalized forgetting. No project has implemented formalized forgetting. FadeMem (B4) has no precedent.
  3. Multi-agent with different models in adversarial roles. Multi-agent stories use the same LLM in different roles. Gen-emerge's ε/η/θ with genuinely different models would be first.
  4. Stagnation detection. No project has a formal stagnation detector. The Martingale Score (B9) has no analogs.
  5. Self-modifying creative architecture. DGM achieves self-modification for coding tasks; no art system has attempted it. This is gen-emerge's long-term frontier.
  6. Cross-domain creative transfer. No system transfers creative strategies between visual art, music, and writing. Gen-emerge's architecture is theoretically domain-agnostic but untested outside visual art.

12. Updated Catalog of Mechanisms and Design Patterns

12.1 Generative Mechanics

Overnight batch production (AARON)
Cohen left AARON running overnight, reviewing 100+ images in the morning. Removing real-time observation eliminates the temptation to micromanage. For gen-emerge: a formalized "batch overnight → morning surprise dashboard" highlighting the most unexpected results by novelty.
Style ambiguity as objective (CAN)
If a CLIP-based style classifier assigns softmax > 0.7 for any style class, this signals convergence. If no class exceeds 0.3, this is the sweet spot. Implementable as a "style ambiguity bonus" in gen-emerge scoring.
Perception-action-evaluation loop (CloudPainter)
The robot photographs its own canvas after each stroke and adjusts the next stroke accordingly. Physical randomness (paint dripping, brush wearing) introduces genuine noise that digital systems lack. For gen-emerge: consider injecting structured randomness at the generation level — "happy accidents" as a diversity mechanism.
Environmental embedding (Painting Fool CUBRIC)
The AI artist embedded in a real institution produces context-responsive art — not generic generation but art rooted in a specific place and time. For gen-emerge: the Snapshot pipeline is an environmental embedding at a lower fidelity. The CUBRIC model suggests richer environmental input could produce richer output.
Data-as-medium (Anadol LNM)
The training dataset IS the creative medium. Nature data → nature aesthetic. WikiArt → art aesthetic. For gen-emerge: curating what goes into the reference corpus is as important as designing the architecture. Domain-specific models (like LNM) outperform general models for specific aesthetics.
Co-evolutionary complexity ladder (POET/OMNI)
Instead of fixed challenges, co-evolve the challenge alongside the solver. For gen-emerge: the constraint ontology could evolve — as the system masters one aesthetic space, new dimensions emerge automatically, preventing plateauing.

12.2 Social and Identity Mechanics

Shepherds (Electric Sheep)
Among 450K passive participants, ~5–20 active shepherds inject novel genetic material. For gen-emerge: formalize the "shepherd" role as periodic exogenous shock.
13-year temporal commitment (Abraham)
Art systems need time to develop identity. Abraham's covenant frames AI art as a temporal process, not a product. For gen-emerge: the long-term identity evolution (accumulated FadeMem, QD archive growth) is the system's primary artistic contribution — the journey, not any single image.
Public persona (Botto/Otto)
Botto's Otto agent demonstrates that an autonomous artist's identity extends beyond its visual output into social interaction. For gen-emerge: the research articles, the map, the system's "voice" — these constitute an identity layer that goes beyond images.
Consent layer (Holly+/Spawning)
As AI art systems mature, the consent/provenance question becomes critical. Spawning builds tools for opt-in/opt-out. For gen-emerge: transparency about training data and reference artists is both ethical and practically useful for positioning.
Embodied co-creation (D.O.U.G.)
The human body as input device blurs the boundary between artist and tool. EEG-linked painting is the extreme: thought → art, with the AI as intermediary. For gen-emerge: implicit feedback (engagement patterns) could supplement the 8 explicit channels.

12.3 Economic and Distribution Mechanics

Weekly auction cadence (Botto)
$5M+ revenue over 4 years with 1 work/week cadence. For gen-emerge: from N generated per cycle, only the best enters the "canonical collection." Scarcity creates value.
Institutional trajectory (Anadol)
Anadol's path: gallery → museum commission (MoMA) → dedicated museum (Dataland). This is a proven monetization ladder for AI art at scale. For gen-emerge: the research articles and dome analysis establish institutional credibility before market entry.
Attribution controversy as narrative (Obvious)
The $432K Obvious sale was driven partly by the controversy itself — the narrative of "who is the artist?" was more valuable than the image. For gen-emerge: the system's unique architecture IS the narrative. Multi-agent, adversarial, self-evolving — this is the story that sells.

13. Positioning of Gen-Emerge (Updated)

Based on this expanded systematic review of 22 projects across five categories, gen-emerge is identified as the first system that simultaneously:

  1. Utilizes LLM + image model in a semantic-to-visual pipeline (not GAN, rules, or fractals)
  2. Operates in a multi-agent multi-model architecture with adversarial roles
  3. Implements formalized forgetting (FadeMem) — unprecedented in any creative system
  4. Applies a QD approach to art generation with a single human
  5. Incorporates stagnation detection (Martingale Score) — no analog in any reviewed system
  6. Develops a persistent artistic identity through accumulated memory and evolving ontology
  7. Maintains a research apparatus that contextualizes its own work in art history (DOME analysis, this review)
Updated Positioning Statement

The nearest analog remains Botto, which resolves diversity through volume (70K/week) and crowd (5K+ voters). Gen-emerge must solve the same problem through architecture at 1–4 images/cycle with a single human. The expanded review reveals that gen-emerge's architecture is convergent with emerging patterns across domains: multi-agent adversarial roles (validated in literature), QD approaches (validated in evolutionary computation), and environmental embedding (validated by Painting Fool's CUBRIC residency). The primary gap is A5: self-improvement — the frontier opened by Sakana's DGM. An art system that evolves its own creative architecture would be genuinely unprecedented.

The Trajectory

Phase 1 (current): Multi-agent multi-model art generation with QD + FadeMem. Unique combination of known mechanisms.
Phase 2 (near-term): Co-evolving constraint ontology (POET-inspired). The aesthetic space itself evolves, preventing plateauing.
Phase 3 (long-term): Self-modifying creative architecture (DGM-inspired). The system discovers better agent configurations, scoring functions, and diversity mechanisms autonomously. The system designs its own creative process.

References

  1. Cohen, H. "Mind, Machine, and Creativity: An Artist's Perspective." Leonardo, 2014. PMC4265294
  2. Stanley, K.O. "Picbreeder: A Case Study in Collaborative Evolutionary Exploration of Design Space." Evolutionary Computation, 19(3), 2011. MIT Press
  3. Elgammal, A. et al. "CAN: Creative Adversarial Networks, Generating Art by Learning About Styles and Deviating from Style Norms." 2017. arXiv:1706.07068
  4. Colton, S. "The Painting Fool: Stories from Building an Automated Painter." In Proc. ICCC 2015. ICCC 2015
  5. Colton, S. "The Painting Fool: Stories from Building an Automated Painter." In Proc. ICCC 2011. ICCC 2011
  6. Kogan, G. "Artist in the Cloud: Towards the Summit of Visual Creativity." NeurIPS 2019 Workshop on Machine Learning for Creativity and Design. PDF
  7. Caselles-Dupré, H. et al. "OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models." 2024. arXiv:2411.10501
  8. Klingemann, M. et al. "Botto: A Decentralized Autonomous Artist." NeurIPS ML4CD Workshop, 2022. PDF
  9. Ding, L. et al. "Quality Diversity through Human Feedback." 2023. arXiv:2310.12103
  10. Fontaine, M.C. & Nikolaidis, S. "Differentiable Quality Diversity." 2021. arXiv:2106.03894
  11. Chakrabarty, T. et al. "CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis." 2024. arXiv:2406.12665
  12. Zhou, Y. et al. "StoryWriter: A Multi-Agent Framework for Long Story Generation." 2025. arXiv:2506.16445
  13. Lu, C. et al. "The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery." 2024. arXiv:2408.06292
  14. Yamada, Y. et al. "The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search." 2025. arXiv:2504.08066
  15. Zhang, J. et al. "Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents." 2025. arXiv:2505.22954
  16. Wang, R. et al. "Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions." 2019. arXiv:1901.01753
  17. Wang, R. et al. "Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions." 2020. arXiv:2003.08536
  18. Zhang, J. et al. "OMNI: Open-endedness via Models of human Notions of Interestingness." ICLR 2024. arXiv:2306.01711
  19. Faldor, M. et al. "Toward Artificial Open-Ended Evolution within Lenia using Quality-Diversity." 2024. arXiv:2406.04235
  20. Chan, B.W.-C. "Lenia — Biology of Artificial Life." Complex Systems, 2020. arXiv:1812.05433
  21. Draves, S. "The Electric Sheep Screen-Saver: A Case Study in Aesthetic Evolution." EvoMUSART 2005. scottdraves.com
  22. Berlyne, D.E. Aesthetics and Psychobiology. Appleton-Century-Crofts, 1971.
  23. Schmidhuber, J. "Gödel Machines: Fully Self-Referential Optimal Universal Self-improvers." 2003. arXiv:cs/0309048
  24. Sims, K. "Artificial Evolution for Computer Graphics." SIGGRAPH, 1991. karlsims.com
  25. Stanley, K.O. & Lehman, J. Why Greatness Cannot Be Planned: The Myth of the Objective. Springer, 2015.

Project Websites

AARON · Karl Sims · Electric Sheep · Picbreeder · AICAN · The Painting Fool · Abraham · Eden.art · Artbreeder · Obvious · Botto · Botto Studio · CloudPainter · Sougwen Chung · Holly+ · AIVA · Sakana AI · Refik Anadol · DATALAND · Mario Klingemann · QD Papers Collection · Lenia

1. Методология отбора и таксономия

Исходный обзор (февраль 2026) анализировал 10 знаковых проектов в области автономного генеративного искусства. Расширенная редакция расширяет охват за пределы визуального искусства, покрывая все домены автономных креативных систем — музыку, литературу, геймдизайн, научные открытия — и вводит новую аналитическую рамку: Спектр автономности.

Из приблизительно 50 выявленных проектов и экспериментов отобраны 22 по шести критериям: (1) автономность — серийная генерация без ручного запуска; (2) обратная связь — механизм, влияющий на последующие поколения; (3) непрерывная эволюция — работа во времени с накоплением истории; (4) проблема разнообразия — столкновение с конвергенцией; (5) масштаб — результаты за пределами прототипа; (6) кросс-доменная релевантность — уроки, переносимые на gen-emerge.

Проекты организованы в пять категорий:

КатегорияПроектыКлючевой вопрос
A. Автономное визуальное искусствоAARON, Sims, Electric Sheep, Picbreeder, CAN, Painting Fool, Abraham/Eden, Artbreeder, Botto, LSI+QDHF, ObviousКак система генерирует искусство без запуска человеком каждой работы?
B. Человеко-AI коллаборативное искусствоCloudPainter, Sougwen Chung / D.O.U.G., Holly Herndon / Holly+Чем сотворчество человеческого тела и AI отличается от чистой автономии?
C. Кросс-доменные креативные AIAIVA (музыка), мультиагентные нарративные системы (литература)Переносятся ли паттерны автономного творчества между медиа?
D. Самосовершенствующиеся AI-системыSakana AI Scientist, Darwin Gödel Machine, POET/OMNIМожет ли AI улучшать сам себя — и что это означает для искусства?
E. Индивидуальные AI-художникиRefik Anadol / LNM, Mario KlingemannКак художники-технологи масштабируют своё видение через AI?

A. Автономные системы визуального искусства

Системы, генерирующие визуальное искусство с минимальным или нулевым участием человека в каждой работе. Основная линия предков gen-emerge.

2. Эра до глубокого обучения

2.1 AARON (Гарольд Коэн, 1973–2016)

AARON признаётся первой долгоживущей автономной арт-системой. Разработанная художником Гарольдом Коэном как экспертная система на C/Lisp на протяжении 43 лет, она автономно генерировала рисунки и картины с использованием кодифицированных знаний о композиции, перспективе и анатомии.

Система прошла приблизительно 60 итераций: от абстрактных линий (1970-е) через фигуры и пространства (1980-е) к автономному выбору цвета (1990-е) и абстрактной живописи (2000-е).

Паттерн конвергенции

AARON конвергировал к собственному стилю Коэна. К 2009 году Коэн пережил творческий кризис и вернулся к ручной живописи поверх выходов AARON, осознав: «творчество заключалось не в программисте и не в программе по отдельности, а в диалоге между программой и программистом».

Урок для gen-emerge: одноавторская система = неизбежная конвергенция к стилю автора. 43 года итераций не преодолели single-source bias. Необходима мультиагентность.

2.2 Карл Симс: Genetic Images / Galápagos (1991–1997)

Пионерский проект по эволюции визуальных форм посредством генетических алгоритмов. Генотип — дерево математических функций; фенотип — изображение, вычисленное для каждого пикселя.

Паттерн конвергенции: интерактивная эволюция ограничена утомлением пользователя. Конвергенция к «привлекательным по умолчанию» паттернам.
Урок для gen-emerge: человек в цикле вносит смещение быстрее, чем алгоритм. «Фаворит ≠ делай больше» — прямой ответ.

2.3 Electric Sheep (Скотт Дрейвс, 1999–н.в.)

Распределённая система эволюции фрактальных анимаций, работающая 27 лет. 450 000+ компьютеров; массовое голосование создаёт аттрактор медианного вкуса. Разнообразие поддерживается через инъекции «пастухов» — ~5–20 активных участников.

Урок для gen-emerge: даже с 450К участниками, без активной внешней инъекции система конвергирует к медианному вкусу. Роль «пастуха» — прототип экзогенной инъекции ограничений (T4d).

2.4 Picbreeder (Стэнли и соавт., 2007–2021)

Платформа коллаборативной эволюции изображений с CPPN-NEAT. Ключевая инновация — «ветвление». Стэнли доказал: «преследование цели ограничивает эволюцию» — изображения через свободное ветвление не могли быть переоткрыты при целенаправленном поиске.

Урок для gen-emerge: QD-подход (B6) подтверждён. Ветвление = предшественник stepping stones (B11).

3. Эра GAN

3.1 CAN / AICAN (Elgammal и соавт., 2017)

Генератор получает два противоречивых сигнала: «выглядит ли как искусство?» и «к какому стилю это?» (максимизация неопределённости). Теория Берлайна: максимум эстетического удовольствия при умеренной новизне.

Урок для gen-emerge: два противоречивых сигнала CAN — прототип MAE-триплета (ε). «Бонус за стилевую неопределённость» реализуем в скоринге.

3.2 The Painting Fool (Саймон Колтон, 2001–н.в.)

Программа-«художник» с тремя свойствами: мастерство, оценка, воображение. Читает газеты, определяет настроение дня, может отказаться рисовать.

Обновление 2024 — резиденция CUBRIC: Painting Fool стал первым виртуальным художником-резидентом в центре нейровизуализации CUBRIC (Кардиффский университет). Целый год система работала автономно внутри здания, наблюдая за исследователями и создавая работы без инструкций. Это ближайший аналог конвейера gen-emerge Snapshot → Ontology: среда → внутреннее состояние → творческий выход.

Урок для gen-emerge: встраивание в среду производит более богатые выходы, чем генерация по промптам. Конвергенция сохраняется даже с инпутом от среды.

3.3 Abraham / Eden (Джин Коган, 2017–н.в.)

«Автономный искусственный художник» — sovereign creative spirit через multi-party computation.

Обновление 2025 — 13-летний ковенант: С октября 2025 Abraham вошёл в 13-летний «творческий ковенант». Eden.art — публичная галерея. «Первые работы Авраама» на выставке AUTOMATA в Лос-Анджелесе — первое соло AI-художника с формальным долгосрочным контрактом. Ковенант — это AI-искусство как временное обязательство, не одноразовый эксперимент.

Урок для gen-emerge: 13-летний ковенант напрямую валидирует темпоральный подход gen-emerge. Арт-системам нужно время для развития идентичности.

3.4 Artbreeder / Ganbreeder (Джоэл Саймон, 2018–н.в.)

14M+ пользователей, 300M+ изображений. Ген-слайдеры + скрещивание в латентном пространстве.

Урок для gen-emerge: одномодельное латентное пространство = потолок разнообразия. Мультимодельная архитектура = преодоление потолка.

3.5 Obvious Collective — Edmond de Belamy (2018)

Коллектив Obvious произвёл «Edmond de Belamy» на GAN, обученной на 15 000 портретах WikiArt. Работа продана на Christie's за $432 500 — 43× от верхней оценки. Однако проект — кейс-стади проблемы атрибуции: код и данные были открыто опубликованы Робби Барратом.

Урок для gen-emerge: мультиагентная архитектура gen-emerge даёт чёткий ответ на вопрос авторства: система — это художник. Продажа за $432K также доказала: арт-рынок ценит нарратив и провенанс не менее эстетики.

4. Эра LLM + Diffusion

4.1 Botto (Марио Клингеманн / BottoDAO, 2021–н.в.)

Ближайший аналог gen-emerge. ~70 000 изображений/неделю; вкусовая модель отбирает 350 для голосования DAO; 1 каноническая работа/неделю → NFT на SuperRare.

Обновления 2024–2025:

  • p5.js инициатива: 22 алгоритма генеративного кода. Выставка SOLOS (февраль 2025). Расширение за пределы diffusion в процедурную эстетику.
  • Otto — Twitter-агент: AI-персона Botto в соцсетях. Первый случай публичной «личности» AI-художника.
  • LLM-тьюторинг по истории искусства: контекстное знание движений и техник для генерации.
  • Sotheby's: переход от крипто-нативного рынка к институциональному.
  • Планы мультиагентной архитектуры: конвергенция с подходом gen-emerge.
Критический урок: Botto решает разнообразие через объём (70К/неделю). Gen-emerge — через архитектуру (1–4/цикл). p5.js инициатива валидирует потенциал gen-emerge в code-based art. Планы Botto по мультиагентности независимо подтверждают тезис gen-emerge. Выручка $5M+ за 4 года подтверждает коммерческую жизнеспособность.

4.2 LSI + QDHF (Fontaine и соавт., 2020–2024)

Quality-Diversity (MAP-Elites) в латентном пространстве SD. QDHF-обученные метрики превосходят ручные оси, отражая человеческое восприятие «различного».

Урок для gen-emerge: прямой прототип T2e. Двойной отпечаток (T2d) — компромисс; QDHF — золотой стандарт.

B. Человеко-AI коллаборативные системы

Системы, где человеческое тело или присутствие — неотъемлемая часть креативного цикла.

5.1 CloudPainter (Пиндар Ван Арман, 2005–н.в.)

Робот-живописец с обратной связью через камеру: фотографирует холст после каждого мазка, оценивает результат через нейросеть, решает следующий мазок. 1000+ холстов за 20 лет. Первое место Robot Art 2018. Ключевая инновация — петля восприятие-действие-оценка в физическом мире: воплощённый интеллект, отсутствующий в цифровых генераторах.

Урок для gen-emerge: петля обратной связи на физическом уровне даёт неожиданные результаты. Архитектура «конкурирующих нейросетей» зеркалит adversarial агентов gen-emerge. Физическое воплощение — потенциальное направление для выходного медиума.

5.2 Sougwen Chung / D.O.U.G. (2015–н.в.)

Роботические руки, обученные на жестах художницы, рисуют в реальном времени в дуэте с человеком. D.O.U.G._1 → D.O.U.G._5: от имитации через импровизацию к независимости.

Spectral (2025): на WEF в Давосе — робот рисует по сигналам ЭЭГ. Мысль → нейросигнал → мазок робота. Самая глубокая связь человеческого сознания и AI-действия в искусстве.

Урок для gen-emerge: D.O.U.G. — принципиально другая модель автономии: AI не заменяет человека, а создаёт новую гибридную сущность. Для gen-emerge: имплицитные сигналы (время взаимодействия, паттерны просмотра) могут быть столь же мощными, как 8 явных каналов.

5.3 Holly Herndon / Holly+ (2021–н.в.)

AI-модель голоса Holly Herndon, которую может использовать любой. DAO голосует за одобрение сгенерированных работ. Spawning строит «уровень согласия» для AI-тренировки — инструменты opt-in/opt-out.

Starmirror (KW Berlin, 2025): инсталляция, трансформирующая голоса посетителей через Holly+ в реальном времени.

Урок для gen-emerge: Holly+ инвертирует обычную модель: человек становится сырьём, AI — механизмом трансформации. Вопрос согласия/провенанса релевантен: по мере развития идентичности системы нужна рамка работы со ссылками на обучающие данные.

C. Кросс-доменные автономные креативные системы

Проявляются ли те же паттерны конвергенции в музыке, литературе, геймдизайне? Ответ — да.

6.1 AIVA — автономная музыкальная композиция (2016–н.в.)

Первый виртуальный композитор, признанный SACEM. Обучена на 30 000+ классических партитур. Музыка технически профессиональна, но критикуется как «приятная, но производная» — компетентные имитации романтизма XIX века. Это аттрактор медианного вкуса (P2) в музыке.

Кросс-доменная валидация: P2 доменно-агностичен — появляется в музыке точно так же, как в визуальном искусстве. Лекарство — архитектурное разнообразие или явное давление новизны (QD).

6.2 Мультиагентные нарративные системы (2023–2025)

Мультиагентные LLM-системы для автономной генерации нарративов — прямая структурная параллель архитектуре gen-emerge:

  • COLLABSTORY (2024): Агенты LLM в ролях протагониста, антагониста, нарратора, редактора. «Редактор» — прямой аналог adversarial оценки gen-emerge.
  • StoryWriter (2024): Иерархическая система: планировщик → писатели → критик. Зеркалит Proposer → Generator → Evaluator.
  • Эксперименты «1001 ночь»: Персонажи с устойчивыми персонами генерируют нарративы на протяжении сотен эпизодов.
Кросс-доменная валидация: мультиагентные adversarial архитектуры конвергируют независимо: визуальное искусство (gen-emerge), литература (COLLABSTORY), код (AI Scientist). Это универсальное решение проблемы креативной конвергенции.

D. Самосовершенствующиеся AI-системы

Системы, улучшающие собственную архитектуру, а не только выходы. Фронтир автономного интеллекта.

7.1 Sakana AI — The AI Scientist (2024–2025)

Автоматизированная система полного цикла научного исследования: гипотезы → эксперименты → код → анализ → статьи. Версия 2 — статьи уровня воркшопов топ ML-конференций. Опубликовано в Nature.

Урок для gen-emerge: AI Scientist валидирует: мультишаговые автономные конвейеры с оценкой способны производить подлинную новизну. Структурная параллель: гипотеза (промпт) → эксперимент (генерация) → анализ (скоринг) → итерация. «Безопасный инкрементализм» в науке = конвергенция в искусстве.

7.2 Sakana AI — Darwin Gödel Machine (2025)

AI-система, модифицирующая собственный код для улучшения производительности. На SWE-bench: 20% → 50% через самомодификацию. Первое практическое приближение к теоретической машине Гёделя Шмидхубера.

Урок для gen-emerge: DGM открывает радикальную траекторию: арт-система, которая не просто генерирует лучшее искусство, а эволюционирует собственную архитектуру генерации. Это «gen-emerge Фаза N» — система проектирует собственный креативный процесс.

7.3 Open-Ended Evolution: POET, OMNI, QD-Lenia

POET: совместная эволюция сред и агентов. OMNI (ICLR 2024): открытая эволюция в многоцелевых пространствах. QD + Lenia: вечно расширяющаяся библиотека «форм жизни».

Урок для gen-emerge: POET: совместная эволюция вызова и решателя. Gen-emerge мог бы совместно эволюционировать пространство ограничений вместе с генерацией. Не просто заполнение архива, а рост размерностей архива.

E. Индивидуальные AI-художники-технологи

Художники, построившие значимые AI-системы как расширение своей творческой практики.

8.1 Refik Anadol / Large Nature Model (2023–н.в.)

Наиболее коммерчески успешная практика на пересечении AI и крупномасштабного искусства. «Дата-скульптуры» — иммерсивные инсталляции, визуализирующие миллионы точек данных.

  • Unsupervised (MoMA, 2023): первый AI-заказ крупного музея. Живая скульптура из 200+ лет коллекции MoMA.
  • Large Nature Model: open-source модель на миллионах природных изображений и звуков. Не адаптация коммерческой модели, а специально построенная для эстетики природы.
  • Dataland (весна 2026): первый музей, посвящённый AI-искусству.
  • Christie's: коллаборация с Месси — $1.87M.
Урок для gen-emerge: парадигма «данные-как-медиум»: художественное высказывание — в тренировочных данных, не только в архитектуре. Целевая модель (LNM) валидирует доменно-специфичный дизайн агентов gen-emerge. Коммерчески: $1.87M + MoMA + музей = жизнеспособный путь.

8.2 Mario Klingemann (2015–н.в.)

Создатель Botto. «Memories of Passersby I» (2018) — первая AI-работа на Sotheby's ($51 000). Техника «Neural Glitch» — намеренная эксплуатация ошибок нейросетей как эстетической стратегии. Мета-творческая практика: художник создаёт условия для возникновения неожиданного искусства из машинных сбоев.

Урок для gen-emerge: Neural Glitch — художественный аналог новизнового скоринга gen-emerge: ошибки как фичи, не баги. Подход «условия для эмерджентности» философски совпадает с архитектурой gen-emerge.

9. Спектр автономности: новая аналитическая рамка

Ключевой вклад расширенного обзора — Спектр автономности, пятимерная рамка для сравнения креативных AI-систем. Каждое измерение 0–10:

ИзмерениеОпределение0 (минимум)10 (максимум)
A1: Автономия генерацииМожет ли система создавать работы без запуска человеком?Человек запускает каждуюНепрерывная генерация
A2: Автономия оценкиМожет ли система оценить качество собственных выходов?Человек оценивает всеПолная самооценка
A3: Автономия эволюцииМожет ли система менять творческое направление?Фиксированный стильСамомодификация целей
A4: Устойчивость идентичностиРазвивает ли система узнаваемую идентичность?Нет памяти между сессиямиКогерентная идентичность на годы
A5: СамосовершенствованиеМожет ли система улучшать собственную архитектуру?Архитектура фиксированаСамомодифицирующаяся
Ключевой инсайт Спектра автономности

Большинство арт-систем несбалансированы: AARON высоко на генерации (8) и идентичности (7), но 0 на самосовершенствовании. Gen-emerge проектируется для сбалансированной автономности по всем пяти измерениям — 7–8 по четырём. Оставшийся разрыв — A5 (самосовершенствование): способность модифицировать собственную архитектуру. Это траектория DGM и AI Scientist.

10. Расширенный сравнительный анализ

ПроектКат.ПериодГенерацияОбратная связьМеханизм разнообразияКонвергенция
AARONA1973–2016На правилахАвтор вручнуюОбновление правилСтиль автора
SimsA1991–97Генетич. алг.ИнтерактивнаяРазнообразие пользователейУтомление + bias
E.SheepA1999–н.в.Fractal FlameГолосование (450K)Инъекции пастуховМедианный вкус
P.FoolA2001–н.в.Мульти-техникаСамооценка + средаРасширение возможностейBias автора
CloudPainterB2005–н.в.RL + физ. краскаКамера → нейросетьФизическая случайностьЦелевое изображение
PicbreederA2007–2021CPPN-NEATИнтеракт. + ветвл.Ветвление линийУгасание сообщества
D.O.U.G.B2015–н.в.Gesture + RLТело + ЭЭГReal-time человеч. вводГраница стиля худ.
AIVAC2016–н.в.Deep learningПравила + MLМульти-эпохальн. тренировкаРомантич. медиана
CANA2017Модиф. GANAdversarialСтилевая неопредел.Mode collapse
AbrahamA2017–н.в.CLIP+VQGAN→SDDAOДецентрализацияBias модели
ArtbreederA2018–н.в.StyleGANСлайдерыВетвление сообществаGAN-потолок
ObviousA2018GAN (портрет)Курат. отборВыбор датасетаОдин датасет
LSI+QDHFA2020–24SD + MAP-ElitesCLIP + чел.QD-архивНасыщение архива
BottoA2021–н.в.SD + taste modelDAO (5K+)Объём + периодыКонв. вкус. модели
Holly+B2021–н.в.Голос. модельDAO одобрениеОткрытость вводаОдно-голос. простр.
AnadolE2023–н.в.Кастом. нейросетьКурация художникаЦелевая тренировкаПотолок данных
AI ScientistD2024–н.в.LLM pipelineАвтомат. ревьюРазнообр. направленийБезопасн. инкремент.
Мульти-аг. нарр.C2024–н.в.Мульти-LLM ролиАгент-редакторРолевое разнообразиеLLM-гомогенность
DGMD2025Самомодиф. кодБенчмаркЭволюц. поискПереобучение на бенчм.
POET/OMNID2019–24Со-эволюцияСреда адаптируетсяОткрытая сложностьМасштаб вычислений

11. Синтез: обновлённые универсальные паттерны

11.1 Семь паттернов конвергенции (расширено с пяти)

Расширенный обзор 22 проектов выявил два дополнительных паттерна конвергенции:

  • P1–P5: те же что и в исходном обзоре (single-source bias, медианный вкус, цель убивает новизну, объём vs. архитектура, потолок модели).
  • P6: Доменный изоморфизм. Те же паттерны конвергенции проявляются в музыке (AIVA), литературе (COLLABSTORY) и науке (AI Scientist). Решения, доказанные в одном домене, переносятся в другие.
  • P7: Безопасный инкрементализм. Самооценивающие системы избегают риска: AI Scientist пишет «безопасные» статьи, вкусовая модель Botto избегает сюрпризов. → Необходим явный бюджет новизны (Martingale Score).

11.2 Обновлённые механизмы противодействия

#МеханизмПроекты-источникиРеализация в gen-emerge
1Внешняя инъекцияElectric Sheep, Botto, P.Fool (CUBRIC)T4d
2Ветвление / stepping stonesPicbreeder, Artbreeder, POETB11
3Adversarial давление стиляCAN, COLLABSTORY, CloudPainterT4e/T4f, ε
4QD-архивLSI+QDHF, Lenia+QDB6
5Мультимодельное разнообразиеBotto, Artbreeder, мульти-аг. нарративыη, θ
6Калиброванные метрикиQDHF, ArtbreederT2e
7Физическое заземлениеCloudPainter, D.O.U.G., Holly+Будущее: воплощённый медиум
8Со-эволюционное давлениеPOET, OMNI, DGMБудущее: со-эволюция онтологии

11.3 Нерешённые проблемы (обновлено)

  1. Долгосрочное разнообразие без вмешательства. FadeMem + Martingale Score — наиболее обещающий подход, но не доказан в масштабе.
  2. Формализованное забывание. FadeMem (B4) не имеет прецедента.
  3. Мультиагентность с разными моделями в adversarial ролях. ε/η/θ gen-emerge были бы первыми.
  4. Детекция стагнации. Martingale Score (B9) не имеет аналогов.
  5. Самомодифицирующаяся креативная архитектура. DGM достигает в коде; в искусстве не пробовал никто.
  6. Кросс-доменный перенос креативных стратегий. Архитектура gen-emerge теоретически доменно-агностична, но не проверена за пределами визуального искусства.

12. Обновлённый каталог механик

12.1 Генеративные механики

Ночное пакетное производство (AARON)
Режим «ночная генерация → утренний дашборд с сюрпризами».
Стилевая неопределённость как цель (CAN)
«Бонус за стилевую неопределённость» в скоринге.
Петля восприятие-действие-оценка (CloudPainter)
Структурированная случайность на уровне генерации — «счастливые случайности» как механизм разнообразия.
Встраивание в среду (Painting Fool CUBRIC)
Конвейер Snapshot — инпут от среды на более низком уровне верности. CUBRIC показывает: богаче среда → богаче выход.
Данные-как-медиум (Anadol LNM)
Курирование референсного корпуса столь же важно, как и дизайн архитектуры.
Со-эволюция сложности (POET/OMNI)
Онтология ограничений эволюционирует — новые эстетические измерения возникают автоматически.

12.2 Социальные механики и механики идентичности

Пастухи (Electric Sheep)
Формализация роли «пастуха» как периодического экзогенного шока.
13-летнее обязательство (Abraham)
Арт-системам нужно время. Долгосрочная эволюция идентичности — главный художественный вклад системы.
Публичная персона (Botto/Otto)
Идентичность автономного художника выходит за визуальный выход в социальное взаимодействие.
Уровень согласия (Holly+/Spawning)
Прозрачность тренировочных данных — и этическая, и практически полезная для позиционирования.
Воплощённое сотворчество (D.O.U.G.)
Имплицитная обратная связь может дополнить 8 явных каналов gen-emerge.

12.3 Экономические механики

Еженедельный аукцион (Botto)
$5M+ за 4 года. 1 работа/неделю. Дефицит создаёт ценность.
Институциональная траектория (Anadol)
Галерея → музейный заказ (MoMA) → собственный музей (Dataland). Доказанная лестница монетизации.
Контроверсия как нарратив (Obvious)
Уникальная архитектура gen-emerge — ЭТО нарратив, который продаёт. Мультиагентный, adversarial, саморазвивающийся.

13. Позиционирование Gen-Emerge (обновлено)

На основании расширенного обзора 22 проектов в пяти категориях gen-emerge идентифицируется как первая система, одновременно:

  1. Использующая LLM + image model в семантическом конвейере
  2. Работающая в мультиагентной мультимодельной архитектуре с adversarial ролями
  3. Реализующая формализованное забывание (FadeMem)
  4. Применяющая QD-подход к генерации искусства с одним человеком
  5. Включающая детекцию стагнации (Martingale Score)
  6. Развивающая устойчивую художественную идентичность через аккумулированную память
  7. Поддерживающая исследовательский аппарат, контекстуализирующий собственные работы (анализ DOME, данный обзор)
Обновлённое позиционирование

Ближайший аналог — Botto (объём 70К/неделю vs. архитектура gen-emerge). Расширенный обзор показывает: архитектура gen-emerge конвергентна с паттернами, возникающими в других доменах: мультиагентные adversarial роли (валидировано в литературе), QD-подходы (валидировано в эволюционных вычислениях), встраивание в среду (валидировано CUBRIC). Ключевой разрыв — A5: самосовершенствование.

Траектория

Фаза 1 (текущая): Мультиагентная мультимодельная генерация с QD + FadeMem.
Фаза 2 (ближайшая): Со-эволюция онтологии ограничений (по модели POET).
Фаза 3 (долгосрочная): Самомодифицирующаяся креативная архитектура (по модели DGM). Система проектирует собственный креативный процесс.