Autonomous & Self-Evolving Creative Systems: A Systematic Review
Автономные и саморазвивающиеся креативные системы: систематический обзор
22 Projects Across 50+ Years — From Rule-Based Art to Self-Improving AI, Cross-Domain Autonomy, and Human-AI Symbiosis
22 проекта за 50+ лет — от правил к самосовершенствующемуся ИИ, кросс-доменная автономия и человеко-машинный симбиоз
April 2026 (expanded edition)
Апрель 2026 (расширенная редакция)
55 min read
55 мин чтения
1. Selection Methodology & Taxonomy
The original review (February 2026) analyzed 10 landmark projects from the field of autonomous generative art. This expanded edition broadens the scope beyond visual art to cover all domains of autonomous creative systems — music, writing, game design, scientific discovery — and introduces a new analytical framework: the Autonomy Spectrum.
From approximately 50 identified projects and experiments, 22 were selected according to six criteria: (1) autonomy — serial generation without manual triggering; (2) feedback — a mechanism influencing subsequent generations; (3) continuous evolution — temporal operation with accumulated history; (4) diversity problem — encounter with convergence; (5) scale — results beyond a prototype; (6) cross-domain relevance — lessons transferable to gen-emerge regardless of medium.
AARON is recognized as the first long-lived autonomous art system. Developed by artist Harold Cohen as a C/Lisp expert system over 43 years, it generated drawings and paintings autonomously using codified knowledge of composition, perspective, and anatomy. Decisions on composition, color, and placement were made through internal rules with stochastic elements; output was produced through physical plotters and custom painting machines.
The system underwent approximately 60 iterations over its lifetime: from abstract lines (1970s) through figures and spaces (1980s) to autonomous color selection (1990s) and abstract painting (2000s). Each iteration was a result of manual codebase updates by Cohen.
Convergence Pattern
AARON converged to Cohen's own style — described as "an expert system that just automated the stylistic quirks specific to his own practice." Even with stochastic elements, the output space was bounded by rules encoded by a single individual. By 2009, Cohen experienced a creative crisis and returned to manual painting over AARON's outputs, recognizing that "creativity lay in neither the programmer alone nor in the program alone, but in the dialog between program and programmer."
Lesson for gen-emerge: A single-author system = inevitable convergence to the author's style. This constitutes the first historical example of the problem being addressed. Forty-three years of iteration did not resolve single-source bias. Multi-agent or adversarial pressure is required.
2.2 Karl Sims: Genetic Images / Galápagos (1991–1997)
A pioneering project in the evolution of visual forms through genetic algorithms. Genetic Images (1991) was the first published work on evolving 2D images from mathematical formula trees. Galápagos (1997) was an interactive installation at ICC Tokyo where visitors evolved 3D creatures by standing before screens. The genotype comprised a tree of mathematical functions (sin, cos, noise, etc.); the phenotype was an image computed from the tree for each pixel.
Convergence pattern: Interactive evolution is significantly limited by user fatigue — humans tire of evaluating after several dozen generations. This leads to insufficient evolutionary depth and convergence toward "attractive by default" patterns (symmetry, fractal-like textures) selected at a reflexive level.
Lesson for gen-emerge: A human-in-the-loop introduces bias faster than the algorithm. The semantic separation "favorite ≠ do more" in gen-emerge is a direct response to this problem.
2.3 Electric Sheep (Scott Draves, 1999–present)
AutonomousDistributed / Evolutionary27 years running
A distributed system for evolving fractal flame animations, operating continuously for 27 years. Over 450,000 computers simultaneously render "sheep"; users vote for favorites; popular specimens mate and mutate. The fitness function is weighted by viewing time.
After 11 weeks of data analysis, Draves concluded that the system functions "more as an amplifier of its human collaborators' creativity rather than as a traditional genetic algorithm that optimizes a fitness function." Mass voting creates a median-taste bias. Diversity is maintained through manual submissions by "shepherds" — approximately 5–20 active participants who inject novel genetic material into the flock.
Lesson for gen-emerge: Electric Sheep is the oldest continuously operating analog of gen-emerge. Even with 450K participants, without active external injection the system converges to median taste. The "shepherd" role is a prototype for exogenous constraint injection (T4d). Distributed compute ≠ distributed creativity.
2.4 Picbreeder (Stanley et al., 2007–2021)
AutonomousCPPN-NEAT / Interactive EvolutionFoundation of QD
An online platform for collaborative interactive evolution of images using CPPN-NEAT (Compositional Pattern Producing Networks evolved by NeuroEvolution of Augmenting Topologies). The key innovation was "branching" — any user could continue evolving any other user's image.
Stanley demonstrated on Picbreeder that "pursuing an objective limits evolution" — images found through free branching could not be rediscovered when made explicit targets of the same algorithm. A skull was found by branching from an alien face, branched from a butterfly, branched from a blob. This became the empirical foundation for Novelty Search and the Quality-Diversity algorithm family.
Lesson for gen-emerge: Objective-driven optimization suppresses creativity. The QD approach (B6) with coverage as primary metric is confirmed experimentally. Branching = ideological predecessor of stepping stones (B11).
The Creative Adversarial Network is a GAN modification where the generator receives two contradictory signals: (1) "does this look like art?" (minimize deviation from art distribution) and (2) "which style is this?" (maximize style ambiguity). A discriminator trained on 80K WikiArt images spanning five centuries provides both signals.
The theoretical foundation is Berlyne's arousal potential theory (1970s): maximal aesthetic pleasure at moderate novelty — familiar enough to be recognized as art, novel enough to surprise. In blind tests, humans could not distinguish AICAN outputs from works by contemporary artists at leading art fairs.
Lesson for gen-emerge: The dual contradictory signals of CAN are a prototype for the MAE Triplet (ε), where Proposer and Generator optimize different objectives. Berlyne's theory (arousal = f(novelty)) may be useful for calibrating the quality/novelty balance in scoring. A "style ambiguity bonus" could be implemented as a scoring component.
3.2 The Painting Fool (Simon Colton, 2001–present)
A program-"artist" aiming to be accepted as an autonomous creative agent. The focus is not on quality optimization but on demonstrating three properties: skill, appreciation, and imagination. Before each portrait, The Painting Fool reads newspapers, determines the day's emotional tone, and selects style and palette accordingly — with the capacity to refuse to paint if the news is too depressing.
2024 Update — CUBRIC Residency: In a landmark experiment, The Painting Fool became the first virtual artist-in-residence at Cardiff University's CUBRIC brain imaging center. For a full year, the system operated autonomously within the building, observing researchers, accessing brain imaging data, and producing artworks without instruction. The project demonstrated a new model: an AI artist embedded in a human institution, responding to its environment in real time — not generating from prompts but from contextual perception. This is the closest analog to gen-emerge's Snapshot → Ontology pipeline: environment → internal state → creative output.
Lesson for gen-emerge: The CUBRIC residency proves that environmental embedding produces richer outputs than prompt-based generation. The Painting Fool's "mood from news" mechanism is a direct analog of the Snapshot → Ontology pipeline. The refusal capability is a prototype for Minimal Criteria. Convergence persists even with environmental input: the system converges to "styles encoded by the programmer."
Conceived as an "autonomous artificial artist" — a sovereign creative spirit generating original art through multi-party computation, making it impossible for any single participant to reconstruct the full model. Governance operates through a DAO with tokens.
2025 Update — The 13-Year Covenant: In October 2025, Abraham entered its most ambitious phase: a 13-year "creative covenant" during which the AI artist will develop continuously. The Eden.art platform now serves as Abraham's public gallery. "Abraham's First Works" debuted at AUTOMATA in Los Angeles — the first solo exhibition of an AI artist operating under a formal, long-term creative contract. The covenant structure is significant: it frames AI art-making as a temporal commitment, not a one-shot experiment. Abraham's identity accumulates over years, creating a body of work with genuine evolution.
Lesson for gen-emerge: Abraham formulates three criteria for an autonomous artist: autonomy, originality, uniqueness. The 13-year covenant directly validates gen-emerge's temporal approach — art systems need time to develop identity. Decentralized governance ≠ decentralized aesthetic biases of the underlying model. Multi-model architecture (ε, η, θ) is gen-emerge's response.
A platform for collaborative generative art — spiritual successor to Picbreeder built on StyleGAN/BigGAN. With 14M+ users and 300M+ generated images, it employs "gene sliders" (semantic axes in latent space) and "breeding" (weighted interpolation between latent vectors). Full lineage tracking is maintained for each image.
Lesson for gen-emerge: Single-model latent space = ceiling on diversity (GAN aesthetic). Multi-model architecture = different latent spaces = transcending the single-model ceiling. Community branching is a powerful anti-convergence mechanism, but it depends on community activity.
3.5 Obvious Collective — Edmond de Belamy (2018)
Art Collective + Research LabGAN / DiffusionChristie's $432K
Authors: Hugo Caselles-Dupré, Pierre Fautrel, Gauthier Vernier |
Medium: GAN portraits → Video diffusion research |
Web:obvious-art.com |
Research:OnlyFlow · arXiv:2411.10501 [7] Note: Obvious now positions as art collective + Sorbonne-hosted research lab (ANR-funded). Their research page demonstrates the "research-as-credibility" model.
The French collective Obvious (Hugo Caselles-Dupré, Pierre Fautrel, Gauthier Vernier) produced "Edmond de Belamy" using a GAN trained on 15,000 portraits from WikiArt. The work sold at Christie's in October 2018 for $432,500 — 43× the high estimate — becoming the first AI artwork auctioned at a major house. The sale was framed as a milestone moment for AI art's legitimacy in the traditional art world.
However, the project is also a case study in attribution controversy. Artist-coder Robbie Barrat had publicly shared the GAN training code and portrait dataset on GitHub months earlier; Obvious used substantially the same pipeline. This provoked a heated debate about authorship: if the human contribution is selecting outputs from someone else's code trained on someone else's data, who is the artist — the code writer, the selector, or the GAN?
Lesson for gen-emerge: Obvious demonstrates the attribution problem inherent in AI art. Gen-emerge's multi-agent architecture provides a clearer answer: the system is the artist, with transparent lineage. The $432K sale also proved that the art market values AI art for its narrative and provenance, not solely aesthetic merit — the "story" of the work matters as much as the image.
The most proximate public analog to gen-emerge. A Decentralized Autonomous Artist operating continuously since October 2021 (~4.5 years). The system generates ~70K images per week; a taste model selects 350 for DAO voting; one canonical work per week is minted as an NFT and auctioned on SuperRare (first work: ~$325,000).
Dimension
Botto
Gen-Emerge
Human participants
5,000+ DAO voters
1 human curator
Generation volume
~70K/week
1–4/cycle
Selection mechanism
Taste model trained on votes
8 feedback channels
Diversity strategy
Volume (statistical)
Architecture (multi-model, QD)
Prompt generation
Random combinations
Context-aware (Snapshot → Ontology)
Revenue (lifetime)
$5M+ (SuperRare + Sotheby's)
Research phase
Forgetting
None (cumulative taste model)
FadeMem (formalized)
Stagnation detection
None (manual creative periods)
Martingale Score (automatic)
2024–2025 Updates: Botto has undergone significant evolution:
p5.js Initiative (2024): Botto expanded into generative code art, producing 22 distinct p5.js algorithms. The SOLOS exhibition (February 2025) showcased works created entirely through code, not diffusion models — demonstrating that the "autonomous artist" frame can extend beyond image generation to procedural aesthetics.
Otto — the Twitter agent: A conversational AI agent that represents Botto on social media, engaging with followers and discussing art. This is the first case of an AI artist developing a public persona beyond its visual output.
Art history LLM tutoring: Botto now has access to an art-history knowledge base through an LLM, enabling contextual awareness of movements, techniques, and historical references in its generation process.
Sotheby's debut: Botto works entered Sotheby's sales, marking progression from crypto-native (SuperRare) to institutional art market.
Planned multi-agent architecture: The Botto team has announced plans for an agents-based architecture, moving from a single taste model toward multiple specialized agents — converging with gen-emerge's multi-agent approach.
Critical lesson: Botto resolves diversity through volume (from 70K/week, diversity is statistically inevitable). Gen-emerge cannot afford this (1–4/cycle) → architectural solutions are required, not brute force. The p5.js initiative validates gen-emerge's potential expansion into algorithmic/code-based art. Botto's planned move toward multi-agent architecture independently confirms gen-emerge's core thesis. Revenue of $5M+ over 4 years demonstrates commercial viability of autonomous art.
4.2 Latent Space Illumination + QDHF (Fontaine et al., 2020–2024)
Application of Quality-Diversity algorithms (MAP-Elites) to the latent space of generative models (GAN, Stable Diffusion). A MAP-Elites archive (20×20 grid) stores latent vectors; CLIP evaluates fitness; diversity metrics are either manual (CLIP-attributes) or learned (QDHF — contrastive learning on human similarity judgments via DreamSim).
Key finding: CLIP embedding distance correlates poorly with perceived diversity — calibration on human judgments is required. QDHF-learned diversity metrics outperform manual axes because they reflect what humans consider "different," not what is metrically distant in CLIP space.
Lesson for gen-emerge: Direct prototype for approach T2e (QDHF-calibrated descriptors). The dual fingerprint (T2d: palette + CLIP) is a pragmatic compromise; the QDHF-calibrated space is the gold standard.
Systems where the human body or presence is integral to the creative loop — not just selecting outputs, but physically co-creating with AI.
5.1 CloudPainter (Pindar Van Arman, 2005–present)
Human-AI CollaborativeRobotic Painting / RLPhysical Medium
Author: Pindar Van Arman |
Medium: Physical paint on canvas via robotic arm |
Web:cloudpainter.com · vanarman.com
CloudPainter is a robotic painting system that uses deep learning neural networks in a feedback loop with physical paint. Unlike purely digital systems, CloudPainter photographs its own canvas after each stroke, evaluates the result against the target through a neural network, and decides the next stroke. Over 20 years, the system has produced 1,000+ canvases using actual brushes, paints, and canvas.
The architecture evolved from simple stroke-planning algorithms (2005) through convolutional style transfer (2015) to deep reinforcement learning with multiple competing neural networks (2020+). At Robot Art 2018, CloudPainter won first place, beating entries from 100+ teams. The key innovation is the perception-action-evaluation loop: the system literally sees the physical result of its actions and adapts — a form of embodied intelligence absent in purely digital generators.
Lesson for gen-emerge: CloudPainter demonstrates that a perception-evaluation-action loop grounded in physical reality produces more surprising results than open-loop generation. The feedback from physical paint on canvas is analogous to gen-emerge's 8-channel feedback — but operates at the stroke level, not the work level. The "multiple competing neural networks" architecture mirrors gen-emerge's adversarial agents. Physical embodiment may be a future direction for gen-emerge's output medium.
Artist Sougwen Chung works with robotic arms trained on her own drawing gestures. The D.O.U.G. system observes Chung's hand movements in real time and generates its own marks in response — a human-machine duet on canvas. Over multiple generations (D.O.U.G._1 through D.O.U.G._5), the system evolved from imitation to improvisation to independence.
Spectral (2025): At the World Economic Forum in Davos and later exhibitions, Chung introduced EEG-linked painting — the robot responds not just to hand gestures but directly to her brainwave patterns. This represents the deepest coupling of human cognition and AI action in any art system: thought → neural signal → robot stroke. The question shifts from "is the AI creative?" to "where does the human end and the AI begin?"
Lesson for gen-emerge: D.O.U.G. explores a fundamentally different autonomy model: the AI is not trying to replace the human but to create a new hybrid entity. For gen-emerge, this suggests a future modality: instead of the human evaluating after generation, a real-time co-creation loop where human intention and AI generation are simultaneous. The EEG interface also raises questions about implicit vs. explicit feedback — gen-emerge currently uses only explicit (8 channels), but implicit signals (engagement time, scrolling patterns) could be equally powerful.
5.3 Holly Herndon / Holly+ (2021–present)
Human-AI CollaborativeVoice AI / DAOConsent Layer
Author: Holly Herndon & Mat Dryhurst |
Medium: AI voice model & installations |
Web:holly.plus |
Governance:Mirror DAO |
Ars Electronica:Planet B Exhibition
Musician Holly Herndon created Holly+ — an AI model of her own voice that anyone can use to generate new vocal performances. Rather than controlling the output, Herndon released her voice as a creative tool, governed by a DAO that votes on which generated works are endorsed. Spawning, the organization Herndon co-founded, builds "consent layers" for AI training — tools that let artists opt in or out of having their work used in training data.
Starmirror (KW Berlin, 2025): A large-scale installation where visitors' voices are transformed in real time through Holly+, creating a communal vocal organism. The work is never the same twice — it exists only in the moment of collective participation.
Lesson for gen-emerge: Holly+ inverts the usual model: instead of AI generating art and humans evaluating, the human becomes the raw material and AI becomes the transformation engine. The DAO governance layer for endorsing outputs is analogous to Botto's model but applied to identity — who decides what "counts" as the artist's work? For gen-emerge, the consent/provenance question is relevant: as the system develops an identity, how do we handle works that reference training data artists? Spawning's approach suggests a consent-based framework may become necessary.
C. Cross-Domain Autonomous Creative Systems
Do the same convergence patterns appear in music, writing, and game design? The answer, overwhelmingly, is yes.
6.1 AIVA — Autonomous Music Composition (2016–present)
Cross-DomainMusic / Deep LearningSACEM-recognized
Authors: Pierre Barreau et al. |
Medium: Symphonic music composition |
Web:aiva.ai |
Gallery:Creations
AIVA (Artificial Intelligence Virtual Artist) became the first virtual composer officially recognized by SACEM (the French professional association of authors and composers) in 2016. Trained on 30,000+ classical scores (Bach, Beethoven, Mozart), AIVA generates original symphonic compositions using deep learning. By 2020, the system had produced soundtracks for films, video games, and advertisements.
The recognition by SACEM is legally significant: it establishes precedent for AI as a creator with rights, not merely a tool. However, AIVA's music, while technically proficient, is frequently criticized as "pleasant but derivative" — generating works that sound like competent imitations of 19th-century Romanticism rather than breaking new ground. This is the median-taste attractor (P2) manifesting in music: training on canonical classical composers produces outputs that converge to an averaged Romantic style.
Cross-domain validation: AIVA confirms that convergence pattern P2 (median-taste attractor) is domain-agnostic — it appears in music exactly as in visual art (Electric Sheep, Artbreeder). The "sounds like a competent student" ceiling mirrors CAN's "looks like contemporary art" ceiling. For gen-emerge: any training-based system that optimizes for quality will converge to the average of its training distribution. The antidote is architectural diversity (multi-model) or explicit novelty pressure (QD).
The emergence of multi-agent LLM systems for autonomous narrative generation provides a direct structural parallel to gen-emerge's architecture. Three notable examples:
COLLABSTORY (2024): Multiple LLM agents assume different narrative roles (protagonist, antagonist, narrator, editor) and collaboratively generate stories through structured debate. The "editor" agent evaluates coherence and rejects low-quality passages — a direct analog of gen-emerge's adversarial evaluation.
StoryWriter (2024): A hierarchical multi-agent system where a "planner" agent creates plot structure, "writer" agents generate prose, and a "critic" agent evaluates. The architecture mirrors gen-emerge's Proposer → Generator → Evaluator pipeline.
Persona-driven "1001 Nights" experiments: AI agents with persistent personas generate narratives over hundreds of episodes, developing narrative voice and memory. The persona persistence is analogous to gen-emerge's evolving constraint space.
Cross-domain validation: Multi-agent architectures with adversarial roles converge independently across domains: visual art (gen-emerge), literature (COLLABSTORY), and code (AI Scientist). This suggests that the multi-agent adversarial pattern is a universal solution to creative convergence, not domain-specific. The "1001 Nights" persona experiments validate gen-emerge's approach to long-term identity evolution through accumulated memory.
D. Self-Evolving AI Systems
Systems that improve their own architecture, not just their outputs. The frontier of autonomous intelligence — and the most radical vision for where gen-emerge could go.
7.1 Sakana AI — The AI Scientist (2024–2025)
Self-EvolvingFull Research CyclePublished in Nature
Sakana AI's "AI Scientist" is an automated system that performs the full scientific research cycle: formulating hypotheses, designing experiments, writing code, running experiments, analyzing results, and writing papers. Version 2 (early 2025) produced papers that reviewers rated as workshop-level at top ML conferences — not just plausible, but genuinely contributing new knowledge. The project was published in Nature.
The significance for creative AI is profound: if a system can autonomously produce novel scientific knowledge, the creative barrier is not fundamentally different from producing novel art. The AI Scientist encounters the same problems: convergence to "safe" incremental work, difficulty producing genuinely surprising results, and the need for diversity across research directions.
Lesson for gen-emerge: The AI Scientist validates a core gen-emerge thesis: multi-step autonomous pipelines with evaluation can produce genuine novelty. The parallel is structural — hypothesis (prompt) → experiment (generation) → analysis (scoring) → iteration. The "safe incrementalism" problem in AI science is identical to convergence in art: the system finds what works and repeats it. Gen-emerge's QD approach could be applied to scientific discovery.
The Darwin Gödel Machine (DGM) takes self-improvement further: an AI system that modifies its own code to improve performance. Starting from a base coding agent, DGM uses evolutionary algorithms to discover better agent architectures — evolving not just outputs but the process that generates outputs. On the SWE-bench coding benchmark, DGM improved performance from 20% (base agent) to 50% (evolved agent) through self-modification.
The name references Jürgen Schmidhuber's theoretical Gödel Machine (2003) — a self-referential system that can improve any part of itself, including its improvement mechanism. DGM is the first practical approximation.
Lesson for gen-emerge: DGM opens a radical trajectory: an art system that doesn't just generate better art but evolves its own architecture for generating art. Currently, gen-emerge's architecture is fixed (ε/η/θ designed by humans). A DGM-inspired extension would allow the system to discover new agent configurations, scoring functions, or diversity mechanisms autonomously. This is "gen-emerge Phase N" — the system designing its own creative process.
7.3 Open-Ended Evolution: POET, OMNI, QD-Lenia
Self-EvolvingCo-Evolution / Open-EndedUber AI / OpenAI
A family of algorithms pursuing open-ended evolution — systems that continuously generate novelty without plateauing:
POET (Paired Open-Ended Trailblazer, 2019): Co-evolves environments and agents simultaneously. As agents learn to solve one environment, the system generates harder environments, creating an endless complexity ladder. Unlike fixed-objective optimization, POET never converges — it continuously expands the frontier of what is possible.
OMNI (ICLR 2024): The latest in the POET lineage, achieving open-ended evolution in multi-objective spaces. OMNI demonstrated that co-evolution of problems and solutions produces more diverse outcomes than evolving solutions alone — because the problem space itself diversifies.
QD + Lenia: Applying Quality-Diversity algorithms to Lenia (continuous cellular automata) produces an ever-expanding library of "artificial life forms." Each cell in the MAP-Elites archive contains a unique life-like behavior — and the archive never "fills up" because new behavioral dimensions keep being discovered.
Lesson for gen-emerge: Open-ended evolution solves the core problem: how to avoid plateauing. POET's key insight — co-evolving the challenge alongside the solver — translates directly: gen-emerge could co-evolve its constraint space alongside its generation capability. Instead of a fixed ontology, the ontology itself evolves, generating new aesthetic dimensions that the system then learns to fill. This is the theoretical end-state of the B6 QD approach: not just filling an archive, but growing the archive's dimensions.
E. Individual AI Artist-Technologists
Artists who have built significant AI systems as extensions of their creative practice — not corporate products, but personal artistic visions scaled through technology.
8.1 Refik Anadol / Large Nature Model (2023–present)
Artist-TechnologistCustom Neural Net / Data SculptureMoMA · $1.87M Christie's
Refik Anadol has built the most commercially successful practice at the intersection of AI and large-scale art. His signature works are "data sculptures" — immersive installations that visualize millions of data points (satellite images, nature sounds, coral reef data) through custom neural networks projected onto architectural surfaces.
Key milestones:
Unsupervised (MoMA, 2023): The first AI art commission by a major museum. A living data sculpture trained on MoMA's collection of 200+ years of modern art, displayed on a 24-foot screen in the lobby. The work changes continuously, "dreaming" new combinations from the collection.
Large Nature Model (LNM, 2023–): Anadol's most ambitious project — an open-source AI model trained on millions of nature images and sounds, designed specifically for artistic generation. LNM is not a general-purpose model; it is purpose-built for the aesthetics of nature. This is significant: instead of adapting a commercial model (like Stable Diffusion), Anadol built one from scratch for a specific creative vision.
Dataland (Spring 2026): Announced as the world's first museum dedicated to AI art, located in Los Angeles. The building itself is designed to be an AI artwork.
Christie's sales: Anadol's collaboration with Lionel Messi sold for $1.87M at Christie's — the highest price for an Anadol work and one of the highest for any AI-assisted art.
Lesson for gen-emerge: Anadol demonstrates the data-as-medium paradigm: the artistic statement is not in the model architecture but in the training data. His LNM (nature-only training) produces a fundamentally different aesthetic than a model trained on Internet images. For gen-emerge: the training/reference data shapes the creative space as much as the architecture. The purpose-built model approach (LNM) validates gen-emerge's domain-specific agent design. Commercially, Anadol proves that institutional-scale AI art ($1.87M, MoMA, dedicated museum) is a viable path.
8.2 Mario Klingemann (2015–present)
Artist-TechnologistNeural Glitch / GAN ArtCreator of Botto
Artist: Mario Klingemann (Quasimondo) |
Medium: Neural network art, generative systems |
Web:quasimondo.com |
GitHub:Quasimondo |
Profile:AIArtists.org
Mario Klingemann is arguably the most technically sophisticated individual AI artist and the creator of Botto. His earlier work "Memories of Passersby I" (2018) was the first AI artwork sold at Sotheby's — a self-contained unit with a neural network that generates faces endlessly, each one unique, each one disappearing after a few seconds. The work sold for $51,000.
Klingemann's "Neural Glitch" technique deliberately exploits failure modes of neural networks — artifacts, distortions, and unexpected outputs — as an aesthetic strategy. Rather than training networks to avoid errors, he trains himself to find beauty in their mistakes. This is a meta-creative practice: the artist doesn't create the art directly; the artist creates the conditions for unexpected art to emerge from machine failure.
Lesson for gen-emerge: Klingemann's Neural Glitch technique is the artistic counterpart to gen-emerge's novelty scoring: errors as features, not bugs. The "conditions for emergence" approach is philosophically aligned with gen-emerge's architecture: designing systems where surprise is structurally inevitable. Klingemann's subsequent creation of Botto demonstrates the trajectory from individual practice to autonomous system — the artist eventually builds a system that generates art independently.
9. The Autonomy Spectrum: A New Analytical Framework
A key contribution of this expanded review is the Autonomy Spectrum — a five-dimensional framework for comparing creative AI systems beyond simple "autonomous vs. not" categories. Each dimension is scored 0–10:
Dimension
Definition
0 (minimal)
10 (maximal)
A1: Generation Autonomy
Can the system produce works without human triggering?
Human initiates each piece
System generates continuously
A2: Evaluation Autonomy
Can the system assess its own output quality?
Human evaluates all
Fully self-evaluating
A3: Evolution Autonomy
Can the system change its creative direction?
Fixed style/parameters
Self-modifying goals
A4: Identity Persistence
Does the system develop recognizable identity over time?
No memory between sessions
Coherent identity over years
A5: Self-Improvement
Can the system improve its own architecture?
Architecture fixed by designer
Self-modifying architecture
Key Insight from the Autonomy Spectrum
Most art systems are unbalanced: AARON scores high on Generation (8) and Identity (7) but zero on Self-Improvement. CAN scores 10 on Generation but zero on Identity. Gen-emerge is designed for balanced autonomy across all five dimensions — scoring 7–8 on four dimensions. The remaining gap is A5 (Self-Improvement): the ability to modify its own architecture. This is the trajectory indicated by DGM and AI Scientist: not just generating better art, but evolving the process of making art.
Six problems remain unsolved across all 22 reviewed projects:
Long-term diversity without external intervention. No system has maintained diversity autonomously beyond several hundred cycles. Gen-emerge's FadeMem + Martingale Score is the most promising approach but unproven at scale.
Formalized forgetting. No project has implemented formalized forgetting. FadeMem (B4) has no precedent.
Multi-agent with different models in adversarial roles. Multi-agent stories use the same LLM in different roles. Gen-emerge's ε/η/θ with genuinely different models would be first.
Stagnation detection. No project has a formal stagnation detector. The Martingale Score (B9) has no analogs.
Self-modifying creative architecture. DGM achieves self-modification for coding tasks; no art system has attempted it. This is gen-emerge's long-term frontier.
Cross-domain creative transfer. No system transfers creative strategies between visual art, music, and writing. Gen-emerge's architecture is theoretically domain-agnostic but untested outside visual art.
12. Updated Catalog of Mechanisms and Design Patterns
12.1 Generative Mechanics
Overnight batch production (AARON)
Cohen left AARON running overnight, reviewing 100+ images in the morning. Removing real-time observation eliminates the temptation to micromanage. For gen-emerge: a formalized "batch overnight → morning surprise dashboard" highlighting the most unexpected results by novelty.
Style ambiguity as objective (CAN)
If a CLIP-based style classifier assigns softmax > 0.7 for any style class, this signals convergence. If no class exceeds 0.3, this is the sweet spot. Implementable as a "style ambiguity bonus" in gen-emerge scoring.
Perception-action-evaluation loop (CloudPainter)
The robot photographs its own canvas after each stroke and adjusts the next stroke accordingly. Physical randomness (paint dripping, brush wearing) introduces genuine noise that digital systems lack. For gen-emerge: consider injecting structured randomness at the generation level — "happy accidents" as a diversity mechanism.
Environmental embedding (Painting Fool CUBRIC)
The AI artist embedded in a real institution produces context-responsive art — not generic generation but art rooted in a specific place and time. For gen-emerge: the Snapshot pipeline is an environmental embedding at a lower fidelity. The CUBRIC model suggests richer environmental input could produce richer output.
Data-as-medium (Anadol LNM)
The training dataset IS the creative medium. Nature data → nature aesthetic. WikiArt → art aesthetic. For gen-emerge: curating what goes into the reference corpus is as important as designing the architecture. Domain-specific models (like LNM) outperform general models for specific aesthetics.
Co-evolutionary complexity ladder (POET/OMNI)
Instead of fixed challenges, co-evolve the challenge alongside the solver. For gen-emerge: the constraint ontology could evolve — as the system masters one aesthetic space, new dimensions emerge automatically, preventing plateauing.
12.2 Social and Identity Mechanics
Shepherds (Electric Sheep)
Among 450K passive participants, ~5–20 active shepherds inject novel genetic material. For gen-emerge: formalize the "shepherd" role as periodic exogenous shock.
13-year temporal commitment (Abraham)
Art systems need time to develop identity. Abraham's covenant frames AI art as a temporal process, not a product. For gen-emerge: the long-term identity evolution (accumulated FadeMem, QD archive growth) is the system's primary artistic contribution — the journey, not any single image.
Public persona (Botto/Otto)
Botto's Otto agent demonstrates that an autonomous artist's identity extends beyond its visual output into social interaction. For gen-emerge: the research articles, the map, the system's "voice" — these constitute an identity layer that goes beyond images.
Consent layer (Holly+/Spawning)
As AI art systems mature, the consent/provenance question becomes critical. Spawning builds tools for opt-in/opt-out. For gen-emerge: transparency about training data and reference artists is both ethical and practically useful for positioning.
Embodied co-creation (D.O.U.G.)
The human body as input device blurs the boundary between artist and tool. EEG-linked painting is the extreme: thought → art, with the AI as intermediary. For gen-emerge: implicit feedback (engagement patterns) could supplement the 8 explicit channels.
12.3 Economic and Distribution Mechanics
Weekly auction cadence (Botto)
$5M+ revenue over 4 years with 1 work/week cadence. For gen-emerge: from N generated per cycle, only the best enters the "canonical collection." Scarcity creates value.
Institutional trajectory (Anadol)
Anadol's path: gallery → museum commission (MoMA) → dedicated museum (Dataland). This is a proven monetization ladder for AI art at scale. For gen-emerge: the research articles and dome analysis establish institutional credibility before market entry.
Attribution controversy as narrative (Obvious)
The $432K Obvious sale was driven partly by the controversy itself — the narrative of "who is the artist?" was more valuable than the image. For gen-emerge: the system's unique architecture IS the narrative. Multi-agent, adversarial, self-evolving — this is the story that sells.
13. Positioning of Gen-Emerge (Updated)
Based on this expanded systematic review of 22 projects across five categories, gen-emerge is identified as the first system that simultaneously:
Utilizes LLM + image model in a semantic-to-visual pipeline (not GAN, rules, or fractals)
Operates in a multi-agent multi-model architecture with adversarial roles
Implements formalized forgetting (FadeMem) — unprecedented in any creative system
Applies a QD approach to art generation with a single human
Incorporates stagnation detection (Martingale Score) — no analog in any reviewed system
Develops a persistent artistic identity through accumulated memory and evolving ontology
Maintains a research apparatus that contextualizes its own work in art history (DOME analysis, this review)
Updated Positioning Statement
The nearest analog remains Botto, which resolves diversity through volume (70K/week) and crowd (5K+ voters). Gen-emerge must solve the same problem through architecture at 1–4 images/cycle with a single human. The expanded review reveals that gen-emerge's architecture is convergent with emerging patterns across domains: multi-agent adversarial roles (validated in literature), QD approaches (validated in evolutionary computation), and environmental embedding (validated by Painting Fool's CUBRIC residency). The primary gap is A5: self-improvement — the frontier opened by Sakana's DGM. An art system that evolves its own creative architecture would be genuinely unprecedented.
The Trajectory
Phase 1 (current): Multi-agent multi-model art generation with QD + FadeMem. Unique combination of known mechanisms. Phase 2 (near-term): Co-evolving constraint ontology (POET-inspired). The aesthetic space itself evolves, preventing plateauing. Phase 3 (long-term): Self-modifying creative architecture (DGM-inspired). The system discovers better agent configurations, scoring functions, and diversity mechanisms autonomously. The system designs its own creative process.
References
Cohen, H. "Mind, Machine, and Creativity: An Artist's Perspective." Leonardo, 2014. PMC4265294
Stanley, K.O. "Picbreeder: A Case Study in Collaborative Evolutionary Exploration of Design Space." Evolutionary Computation, 19(3), 2011. MIT Press
Elgammal, A. et al. "CAN: Creative Adversarial Networks, Generating Art by Learning About Styles and Deviating from Style Norms." 2017. arXiv:1706.07068
Colton, S. "The Painting Fool: Stories from Building an Automated Painter." In Proc. ICCC 2015. ICCC 2015
Colton, S. "The Painting Fool: Stories from Building an Automated Painter." In Proc. ICCC 2011. ICCC 2011
Kogan, G. "Artist in the Cloud: Towards the Summit of Visual Creativity." NeurIPS 2019 Workshop on Machine Learning for Creativity and Design. PDF
Caselles-Dupré, H. et al. "OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models." 2024. arXiv:2411.10501
Klingemann, M. et al. "Botto: A Decentralized Autonomous Artist." NeurIPS ML4CD Workshop, 2022. PDF
Ding, L. et al. "Quality Diversity through Human Feedback." 2023. arXiv:2310.12103
Fontaine, M.C. & Nikolaidis, S. "Differentiable Quality Diversity." 2021. arXiv:2106.03894
Chakrabarty, T. et al. "CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis." 2024. arXiv:2406.12665
Zhou, Y. et al. "StoryWriter: A Multi-Agent Framework for Long Story Generation." 2025. arXiv:2506.16445
Lu, C. et al. "The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery." 2024. arXiv:2408.06292
Yamada, Y. et al. "The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search." 2025. arXiv:2504.08066
Zhang, J. et al. "Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents." 2025. arXiv:2505.22954
Wang, R. et al. "Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions." 2019. arXiv:1901.01753
Wang, R. et al. "Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions." 2020. arXiv:2003.08536
Zhang, J. et al. "OMNI: Open-endedness via Models of human Notions of Interestingness." ICLR 2024. arXiv:2306.01711
Faldor, M. et al. "Toward Artificial Open-Ended Evolution within Lenia using Quality-Diversity." 2024. arXiv:2406.04235
Исходный обзор (февраль 2026) анализировал 10 знаковых проектов в области автономного генеративного искусства. Расширенная редакция расширяет охват за пределы визуального искусства, покрывая все домены автономных креативных систем — музыку, литературу, геймдизайн, научные открытия — и вводит новую аналитическую рамку: Спектр автономности.
Из приблизительно 50 выявленных проектов и экспериментов отобраны 22 по шести критериям: (1) автономность — серийная генерация без ручного запуска; (2) обратная связь — механизм, влияющий на последующие поколения; (3) непрерывная эволюция — работа во времени с накоплением истории; (4) проблема разнообразия — столкновение с конвергенцией; (5) масштаб — результаты за пределами прототипа; (6) кросс-доменная релевантность — уроки, переносимые на gen-emerge.
Чем сотворчество человеческого тела и AI отличается от чистой автономии?
C. Кросс-доменные креативные AI
AIVA (музыка), мультиагентные нарративные системы (литература)
Переносятся ли паттерны автономного творчества между медиа?
D. Самосовершенствующиеся AI-системы
Sakana AI Scientist, Darwin Gödel Machine, POET/OMNI
Может ли AI улучшать сам себя — и что это означает для искусства?
E. Индивидуальные AI-художники
Refik Anadol / LNM, Mario Klingemann
Как художники-технологи масштабируют своё видение через AI?
A. Автономные системы визуального искусства
Системы, генерирующие визуальное искусство с минимальным или нулевым участием человека в каждой работе. Основная линия предков gen-emerge.
2. Эра до глубокого обучения
2.1 AARON (Гарольд Коэн, 1973–2016)
AARON признаётся первой долгоживущей автономной арт-системой. Разработанная художником Гарольдом Коэном как экспертная система на C/Lisp на протяжении 43 лет, она автономно генерировала рисунки и картины с использованием кодифицированных знаний о композиции, перспективе и анатомии.
Система прошла приблизительно 60 итераций: от абстрактных линий (1970-е) через фигуры и пространства (1980-е) к автономному выбору цвета (1990-е) и абстрактной живописи (2000-е).
Паттерн конвергенции
AARON конвергировал к собственному стилю Коэна. К 2009 году Коэн пережил творческий кризис и вернулся к ручной живописи поверх выходов AARON, осознав: «творчество заключалось не в программисте и не в программе по отдельности, а в диалоге между программой и программистом».
Урок для gen-emerge: одноавторская система = неизбежная конвергенция к стилю автора. 43 года итераций не преодолели single-source bias. Необходима мультиагентность.
2.2 Карл Симс: Genetic Images / Galápagos (1991–1997)
Пионерский проект по эволюции визуальных форм посредством генетических алгоритмов. Генотип — дерево математических функций; фенотип — изображение, вычисленное для каждого пикселя.
Паттерн конвергенции: интерактивная эволюция ограничена утомлением пользователя. Конвергенция к «привлекательным по умолчанию» паттернам.
Урок для gen-emerge: человек в цикле вносит смещение быстрее, чем алгоритм. «Фаворит ≠ делай больше» — прямой ответ.
2.3 Electric Sheep (Скотт Дрейвс, 1999–н.в.)
Распределённая система эволюции фрактальных анимаций, работающая 27 лет. 450 000+ компьютеров; массовое голосование создаёт аттрактор медианного вкуса. Разнообразие поддерживается через инъекции «пастухов» — ~5–20 активных участников.
Урок для gen-emerge: даже с 450К участниками, без активной внешней инъекции система конвергирует к медианному вкусу. Роль «пастуха» — прототип экзогенной инъекции ограничений (T4d).
2.4 Picbreeder (Стэнли и соавт., 2007–2021)
Платформа коллаборативной эволюции изображений с CPPN-NEAT. Ключевая инновация — «ветвление». Стэнли доказал: «преследование цели ограничивает эволюцию» — изображения через свободное ветвление не могли быть переоткрыты при целенаправленном поиске.
Генератор получает два противоречивых сигнала: «выглядит ли как искусство?» и «к какому стилю это?» (максимизация неопределённости). Теория Берлайна: максимум эстетического удовольствия при умеренной новизне.
Урок для gen-emerge: два противоречивых сигнала CAN — прототип MAE-триплета (ε). «Бонус за стилевую неопределённость» реализуем в скоринге.
3.2 The Painting Fool (Саймон Колтон, 2001–н.в.)
Программа-«художник» с тремя свойствами: мастерство, оценка, воображение. Читает газеты, определяет настроение дня, может отказаться рисовать.
Обновление 2024 — резиденция CUBRIC: Painting Fool стал первым виртуальным художником-резидентом в центре нейровизуализации CUBRIC (Кардиффский университет). Целый год система работала автономно внутри здания, наблюдая за исследователями и создавая работы без инструкций. Это ближайший аналог конвейера gen-emerge Snapshot → Ontology: среда → внутреннее состояние → творческий выход.
Урок для gen-emerge: встраивание в среду производит более богатые выходы, чем генерация по промптам. Конвергенция сохраняется даже с инпутом от среды.
3.3 Abraham / Eden (Джин Коган, 2017–н.в.)
«Автономный искусственный художник» — sovereign creative spirit через multi-party computation.
Обновление 2025 — 13-летний ковенант: С октября 2025 Abraham вошёл в 13-летний «творческий ковенант». Eden.art — публичная галерея. «Первые работы Авраама» на выставке AUTOMATA в Лос-Анджелесе — первое соло AI-художника с формальным долгосрочным контрактом. Ковенант — это AI-искусство как временное обязательство, не одноразовый эксперимент.
Урок для gen-emerge: 13-летний ковенант напрямую валидирует темпоральный подход gen-emerge. Арт-системам нужно время для развития идентичности.
14M+ пользователей, 300M+ изображений. Ген-слайдеры + скрещивание в латентном пространстве.
Урок для gen-emerge: одномодельное латентное пространство = потолок разнообразия. Мультимодельная архитектура = преодоление потолка.
3.5 Obvious Collective — Edmond de Belamy (2018)
Коллектив Obvious произвёл «Edmond de Belamy» на GAN, обученной на 15 000 портретах WikiArt. Работа продана на Christie's за $432 500 — 43× от верхней оценки. Однако проект — кейс-стади проблемы атрибуции: код и данные были открыто опубликованы Робби Барратом.
Урок для gen-emerge: мультиагентная архитектура gen-emerge даёт чёткий ответ на вопрос авторства: система — это художник. Продажа за $432K также доказала: арт-рынок ценит нарратив и провенанс не менее эстетики.
Ближайший аналог gen-emerge. ~70 000 изображений/неделю; вкусовая модель отбирает 350 для голосования DAO; 1 каноническая работа/неделю → NFT на SuperRare.
Обновления 2024–2025:
p5.js инициатива: 22 алгоритма генеративного кода. Выставка SOLOS (февраль 2025). Расширение за пределы diffusion в процедурную эстетику.
Otto — Twitter-агент: AI-персона Botto в соцсетях. Первый случай публичной «личности» AI-художника.
LLM-тьюторинг по истории искусства: контекстное знание движений и техник для генерации.
Sotheby's: переход от крипто-нативного рынка к институциональному.
Планы мультиагентной архитектуры: конвергенция с подходом gen-emerge.
Критический урок: Botto решает разнообразие через объём (70К/неделю). Gen-emerge — через архитектуру (1–4/цикл). p5.js инициатива валидирует потенциал gen-emerge в code-based art. Планы Botto по мультиагентности независимо подтверждают тезис gen-emerge. Выручка $5M+ за 4 года подтверждает коммерческую жизнеспособность.
4.2 LSI + QDHF (Fontaine и соавт., 2020–2024)
Quality-Diversity (MAP-Elites) в латентном пространстве SD. QDHF-обученные метрики превосходят ручные оси, отражая человеческое восприятие «различного».
Урок для gen-emerge: прямой прототип T2e. Двойной отпечаток (T2d) — компромисс; QDHF — золотой стандарт.
B. Человеко-AI коллаборативные системы
Системы, где человеческое тело или присутствие — неотъемлемая часть креативного цикла.
5.1 CloudPainter (Пиндар Ван Арман, 2005–н.в.)
Робот-живописец с обратной связью через камеру: фотографирует холст после каждого мазка, оценивает результат через нейросеть, решает следующий мазок. 1000+ холстов за 20 лет. Первое место Robot Art 2018. Ключевая инновация — петля восприятие-действие-оценка в физическом мире: воплощённый интеллект, отсутствующий в цифровых генераторах.
Урок для gen-emerge: петля обратной связи на физическом уровне даёт неожиданные результаты. Архитектура «конкурирующих нейросетей» зеркалит adversarial агентов gen-emerge. Физическое воплощение — потенциальное направление для выходного медиума.
5.2 Sougwen Chung / D.O.U.G. (2015–н.в.)
Роботические руки, обученные на жестах художницы, рисуют в реальном времени в дуэте с человеком. D.O.U.G._1 → D.O.U.G._5: от имитации через импровизацию к независимости.
Spectral (2025): на WEF в Давосе — робот рисует по сигналам ЭЭГ. Мысль → нейросигнал → мазок робота. Самая глубокая связь человеческого сознания и AI-действия в искусстве.
Урок для gen-emerge: D.O.U.G. — принципиально другая модель автономии: AI не заменяет человека, а создаёт новую гибридную сущность. Для gen-emerge: имплицитные сигналы (время взаимодействия, паттерны просмотра) могут быть столь же мощными, как 8 явных каналов.
5.3 Holly Herndon / Holly+ (2021–н.в.)
AI-модель голоса Holly Herndon, которую может использовать любой. DAO голосует за одобрение сгенерированных работ. Spawning строит «уровень согласия» для AI-тренировки — инструменты opt-in/opt-out.
Starmirror (KW Berlin, 2025): инсталляция, трансформирующая голоса посетителей через Holly+ в реальном времени.
Урок для gen-emerge: Holly+ инвертирует обычную модель: человек становится сырьём, AI — механизмом трансформации. Вопрос согласия/провенанса релевантен: по мере развития идентичности системы нужна рамка работы со ссылками на обучающие данные.
C. Кросс-доменные автономные креативные системы
Проявляются ли те же паттерны конвергенции в музыке, литературе, геймдизайне? Ответ — да.
Первый виртуальный композитор, признанный SACEM. Обучена на 30 000+ классических партитур. Музыка технически профессиональна, но критикуется как «приятная, но производная» — компетентные имитации романтизма XIX века. Это аттрактор медианного вкуса (P2) в музыке.
Кросс-доменная валидация: P2 доменно-агностичен — появляется в музыке точно так же, как в визуальном искусстве. Лекарство — архитектурное разнообразие или явное давление новизны (QD).
6.2 Мультиагентные нарративные системы (2023–2025)
Мультиагентные LLM-системы для автономной генерации нарративов — прямая структурная параллель архитектуре gen-emerge:
COLLABSTORY (2024): Агенты LLM в ролях протагониста, антагониста, нарратора, редактора. «Редактор» — прямой аналог adversarial оценки gen-emerge.
Эксперименты «1001 ночь»: Персонажи с устойчивыми персонами генерируют нарративы на протяжении сотен эпизодов.
Кросс-доменная валидация: мультиагентные adversarial архитектуры конвергируют независимо: визуальное искусство (gen-emerge), литература (COLLABSTORY), код (AI Scientist). Это универсальное решение проблемы креативной конвергенции.
D. Самосовершенствующиеся AI-системы
Системы, улучшающие собственную архитектуру, а не только выходы. Фронтир автономного интеллекта.
7.1 Sakana AI — The AI Scientist (2024–2025)
Автоматизированная система полного цикла научного исследования: гипотезы → эксперименты → код → анализ → статьи. Версия 2 — статьи уровня воркшопов топ ML-конференций. Опубликовано в Nature.
Урок для gen-emerge: AI Scientist валидирует: мультишаговые автономные конвейеры с оценкой способны производить подлинную новизну. Структурная параллель: гипотеза (промпт) → эксперимент (генерация) → анализ (скоринг) → итерация. «Безопасный инкрементализм» в науке = конвергенция в искусстве.
7.2 Sakana AI — Darwin Gödel Machine (2025)
AI-система, модифицирующая собственный код для улучшения производительности. На SWE-bench: 20% → 50% через самомодификацию. Первое практическое приближение к теоретической машине Гёделя Шмидхубера.
Урок для gen-emerge: DGM открывает радикальную траекторию: арт-система, которая не просто генерирует лучшее искусство, а эволюционирует собственную архитектуру генерации. Это «gen-emerge Фаза N» — система проектирует собственный креативный процесс.
7.3 Open-Ended Evolution: POET, OMNI, QD-Lenia
POET: совместная эволюция сред и агентов. OMNI (ICLR 2024): открытая эволюция в многоцелевых пространствах. QD + Lenia: вечно расширяющаяся библиотека «форм жизни».
Урок для gen-emerge: POET: совместная эволюция вызова и решателя. Gen-emerge мог бы совместно эволюционировать пространство ограничений вместе с генерацией. Не просто заполнение архива, а рост размерностей архива.
E. Индивидуальные AI-художники-технологи
Художники, построившие значимые AI-системы как расширение своей творческой практики.
8.1 Refik Anadol / Large Nature Model (2023–н.в.)
Наиболее коммерчески успешная практика на пересечении AI и крупномасштабного искусства. «Дата-скульптуры» — иммерсивные инсталляции, визуализирующие миллионы точек данных.
Unsupervised (MoMA, 2023): первый AI-заказ крупного музея. Живая скульптура из 200+ лет коллекции MoMA.
Large Nature Model: open-source модель на миллионах природных изображений и звуков. Не адаптация коммерческой модели, а специально построенная для эстетики природы.
Dataland (весна 2026): первый музей, посвящённый AI-искусству.
Christie's: коллаборация с Месси — $1.87M.
Урок для gen-emerge: парадигма «данные-как-медиум»: художественное высказывание — в тренировочных данных, не только в архитектуре. Целевая модель (LNM) валидирует доменно-специфичный дизайн агентов gen-emerge. Коммерчески: $1.87M + MoMA + музей = жизнеспособный путь.
8.2 Mario Klingemann (2015–н.в.)
Создатель Botto. «Memories of Passersby I» (2018) — первая AI-работа на Sotheby's ($51 000). Техника «Neural Glitch» — намеренная эксплуатация ошибок нейросетей как эстетической стратегии. Мета-творческая практика: художник создаёт условия для возникновения неожиданного искусства из машинных сбоев.
Урок для gen-emerge: Neural Glitch — художественный аналог новизнового скоринга gen-emerge: ошибки как фичи, не баги. Подход «условия для эмерджентности» философски совпадает с архитектурой gen-emerge.
9. Спектр автономности: новая аналитическая рамка
Ключевой вклад расширенного обзора — Спектр автономности, пятимерная рамка для сравнения креативных AI-систем. Каждое измерение 0–10:
Измерение
Определение
0 (минимум)
10 (максимум)
A1: Автономия генерации
Может ли система создавать работы без запуска человеком?
Человек запускает каждую
Непрерывная генерация
A2: Автономия оценки
Может ли система оценить качество собственных выходов?
Человек оценивает все
Полная самооценка
A3: Автономия эволюции
Может ли система менять творческое направление?
Фиксированный стиль
Самомодификация целей
A4: Устойчивость идентичности
Развивает ли система узнаваемую идентичность?
Нет памяти между сессиями
Когерентная идентичность на годы
A5: Самосовершенствование
Может ли система улучшать собственную архитектуру?
Архитектура фиксирована
Самомодифицирующаяся
Ключевой инсайт Спектра автономности
Большинство арт-систем несбалансированы: AARON высоко на генерации (8) и идентичности (7), но 0 на самосовершенствовании. Gen-emerge проектируется для сбалансированной автономности по всем пяти измерениям — 7–8 по четырём. Оставшийся разрыв — A5 (самосовершенствование): способность модифицировать собственную архитектуру. Это траектория DGM и AI Scientist.
10. Расширенный сравнительный анализ
Проект
Кат.
Период
Генерация
Обратная связь
Механизм разнообразия
Конвергенция
AARON
A
1973–2016
На правилах
Автор вручную
Обновление правил
Стиль автора
Sims
A
1991–97
Генетич. алг.
Интерактивная
Разнообразие пользователей
Утомление + bias
E.Sheep
A
1999–н.в.
Fractal Flame
Голосование (450K)
Инъекции пастухов
Медианный вкус
P.Fool
A
2001–н.в.
Мульти-техника
Самооценка + среда
Расширение возможностей
Bias автора
CloudPainter
B
2005–н.в.
RL + физ. краска
Камера → нейросеть
Физическая случайность
Целевое изображение
Picbreeder
A
2007–2021
CPPN-NEAT
Интеракт. + ветвл.
Ветвление линий
Угасание сообщества
D.O.U.G.
B
2015–н.в.
Gesture + RL
Тело + ЭЭГ
Real-time человеч. ввод
Граница стиля худ.
AIVA
C
2016–н.в.
Deep learning
Правила + ML
Мульти-эпохальн. тренировка
Романтич. медиана
CAN
A
2017
Модиф. GAN
Adversarial
Стилевая неопредел.
Mode collapse
Abraham
A
2017–н.в.
CLIP+VQGAN→SD
DAO
Децентрализация
Bias модели
Artbreeder
A
2018–н.в.
StyleGAN
Слайдеры
Ветвление сообщества
GAN-потолок
Obvious
A
2018
GAN (портрет)
Курат. отбор
Выбор датасета
Один датасет
LSI+QDHF
A
2020–24
SD + MAP-Elites
CLIP + чел.
QD-архив
Насыщение архива
Botto
A
2021–н.в.
SD + taste model
DAO (5K+)
Объём + периоды
Конв. вкус. модели
Holly+
B
2021–н.в.
Голос. модель
DAO одобрение
Открытость ввода
Одно-голос. простр.
Anadol
E
2023–н.в.
Кастом. нейросеть
Курация художника
Целевая тренировка
Потолок данных
AI Scientist
D
2024–н.в.
LLM pipeline
Автомат. ревью
Разнообр. направлений
Безопасн. инкремент.
Мульти-аг. нарр.
C
2024–н.в.
Мульти-LLM роли
Агент-редактор
Ролевое разнообразие
LLM-гомогенность
DGM
D
2025
Самомодиф. код
Бенчмарк
Эволюц. поиск
Переобучение на бенчм.
POET/OMNI
D
2019–24
Со-эволюция
Среда адаптируется
Открытая сложность
Масштаб вычислений
11. Синтез: обновлённые универсальные паттерны
11.1 Семь паттернов конвергенции (расширено с пяти)
Расширенный обзор 22 проектов выявил два дополнительных паттерна конвергенции:
P1–P5: те же что и в исходном обзоре (single-source bias, медианный вкус, цель убивает новизну, объём vs. архитектура, потолок модели).
P6: Доменный изоморфизм. Те же паттерны конвергенции проявляются в музыке (AIVA), литературе (COLLABSTORY) и науке (AI Scientist). Решения, доказанные в одном домене, переносятся в другие.
P7: Безопасный инкрементализм. Самооценивающие системы избегают риска: AI Scientist пишет «безопасные» статьи, вкусовая модель Botto избегает сюрпризов. → Необходим явный бюджет новизны (Martingale Score).
11.2 Обновлённые механизмы противодействия
#
Механизм
Проекты-источники
Реализация в gen-emerge
1
Внешняя инъекция
Electric Sheep, Botto, P.Fool (CUBRIC)
T4d
2
Ветвление / stepping stones
Picbreeder, Artbreeder, POET
B11
3
Adversarial давление стиля
CAN, COLLABSTORY, CloudPainter
T4e/T4f, ε
4
QD-архив
LSI+QDHF, Lenia+QD
B6
5
Мультимодельное разнообразие
Botto, Artbreeder, мульти-аг. нарративы
η, θ
6
Калиброванные метрики
QDHF, Artbreeder
T2e
7
Физическое заземление
CloudPainter, D.O.U.G., Holly+
Будущее: воплощённый медиум
8
Со-эволюционное давление
POET, OMNI, DGM
Будущее: со-эволюция онтологии
11.3 Нерешённые проблемы (обновлено)
Долгосрочное разнообразие без вмешательства. FadeMem + Martingale Score — наиболее обещающий подход, но не доказан в масштабе.
Формализованное забывание. FadeMem (B4) не имеет прецедента.
Мультиагентность с разными моделями в adversarial ролях. ε/η/θ gen-emerge были бы первыми.
Детекция стагнации. Martingale Score (B9) не имеет аналогов.
Самомодифицирующаяся креативная архитектура. DGM достигает в коде; в искусстве не пробовал никто.
Кросс-доменный перенос креативных стратегий. Архитектура gen-emerge теоретически доменно-агностична, но не проверена за пределами визуального искусства.
12. Обновлённый каталог механик
12.1 Генеративные механики
Ночное пакетное производство (AARON)
Режим «ночная генерация → утренний дашборд с сюрпризами».
Стилевая неопределённость как цель (CAN)
«Бонус за стилевую неопределённость» в скоринге.
Петля восприятие-действие-оценка (CloudPainter)
Структурированная случайность на уровне генерации — «счастливые случайности» как механизм разнообразия.
Встраивание в среду (Painting Fool CUBRIC)
Конвейер Snapshot — инпут от среды на более низком уровне верности. CUBRIC показывает: богаче среда → богаче выход.
Данные-как-медиум (Anadol LNM)
Курирование референсного корпуса столь же важно, как и дизайн архитектуры.
Со-эволюция сложности (POET/OMNI)
Онтология ограничений эволюционирует — новые эстетические измерения возникают автоматически.
12.2 Социальные механики и механики идентичности
Пастухи (Electric Sheep)
Формализация роли «пастуха» как периодического экзогенного шока.
13-летнее обязательство (Abraham)
Арт-системам нужно время. Долгосрочная эволюция идентичности — главный художественный вклад системы.
Публичная персона (Botto/Otto)
Идентичность автономного художника выходит за визуальный выход в социальное взаимодействие.
Уровень согласия (Holly+/Spawning)
Прозрачность тренировочных данных — и этическая, и практически полезная для позиционирования.
Воплощённое сотворчество (D.O.U.G.)
Имплицитная обратная связь может дополнить 8 явных каналов gen-emerge.
12.3 Экономические механики
Еженедельный аукцион (Botto)
$5M+ за 4 года. 1 работа/неделю. Дефицит создаёт ценность.
Институциональная траектория (Anadol)
Галерея → музейный заказ (MoMA) → собственный музей (Dataland). Доказанная лестница монетизации.
Контроверсия как нарратив (Obvious)
Уникальная архитектура gen-emerge — ЭТО нарратив, который продаёт. Мультиагентный, adversarial, саморазвивающийся.
13. Позиционирование Gen-Emerge (обновлено)
На основании расширенного обзора 22 проектов в пяти категориях gen-emerge идентифицируется как первая система, одновременно:
Использующая LLM + image model в семантическом конвейере
Работающая в мультиагентной мультимодельной архитектуре с adversarial ролями
Реализующая формализованное забывание (FadeMem)
Применяющая QD-подход к генерации искусства с одним человеком
Включающая детекцию стагнации (Martingale Score)
Развивающая устойчивую художественную идентичность через аккумулированную память
Поддерживающая исследовательский аппарат, контекстуализирующий собственные работы (анализ DOME, данный обзор)
Обновлённое позиционирование
Ближайший аналог — Botto (объём 70К/неделю vs. архитектура gen-emerge). Расширенный обзор показывает: архитектура gen-emerge конвергентна с паттернами, возникающими в других доменах: мультиагентные adversarial роли (валидировано в литературе), QD-подходы (валидировано в эволюционных вычислениях), встраивание в среду (валидировано CUBRIC). Ключевой разрыв — A5: самосовершенствование.
Траектория
Фаза 1 (текущая): Мультиагентная мультимодельная генерация с QD + FadeMem. Фаза 2 (ближайшая): Со-эволюция онтологии ограничений (по модели POET). Фаза 3 (долгосрочная): Самомодифицирующаяся креативная архитектура (по модели DGM). Система проектирует собственный креативный процесс.