DAC 2009

From Robert-Depot
Jump to: navigation, search

Not_Robert: Collaboration and Co-production in Language Systems

DAC 2009, Theme: Cognition and creativity


This paper explores the application of language processing technologies to the creation of interactive systems, particularly the problem of establishing the computer as a meaningful contributor to simple networks composed of artist, computer, and viewer. Throughout, this paper considers the ways in which effects of identification and interpretation are amplified and modulated through narrative framing. Two projects, a chatbot and a language processor, are considered as case studies of simple interactive systems, and as waypoints in a larger investigation of our desire (and predisposition) to ascribe meaning to events, experience, images, and objects. This proclivity is shown to have interesting ramifications with the accumulation of personal information on-line—both in public (Flickr, Facebook, Twitter) and in private (email, cellphone logs, bank accounts, health records)—suggesting the application of language processing and data-mining techniques to create custom, hyper-personal interactive experiences.

The first project, Megahal Grandmommy, is an n-gram Markov chain chatbot, trained as a surrogate for a family member suffering from Alzheimer’s disease. Shifting the computational goal from plausible simulation of conversation to plausible simulation of disease, this piece harnesses the inherent dysfunction of the model as a descriptive asset in the service of creative intent. The role and operation of narrative in this piece is discussed: how the viewer comes to understand the parameters of a narrative space (the artist’s identity, the program’s character and history) through clues uncovered in conversation with the software, and how that frame is essential to maintain the experience of the piece. Applying statistical language tools to form what is a very simple computational mechanism, this project nonetheless creates a rich interactive experience, exploiting the viewer’s predisposition to attach significance to the computer’s utterances on account of the character it performs (the artist’s grandmother).

Having identified language, specifically the formulation of ideas into language, as the most promising site for intervention in the creative process, this paper frames the problem of imagination and choice in studio practice as a series of operations on sets of data (ideas, images, and materials) represented in the space of language. This choice of representation is drawn from the idea of indexical language, as an approach to the problem of representing materially diverse assets in a common space—in this case a multi-dimensional vector space as described in Natural Language Processing (NLP) and Computational Linguistics (CL) literature. Using tools from those fields, specifically vector representations and relatedness operations, I describe and implement a language processing system, applying it to the domains of video composition and an art-idea machine, creating formulae for studio art objects. Less reliant on narrative framing than the chatbot project, the effort here is a decomposition of the creative process into data and operations, searching for new points of human-computer collaboration. Given a large, rich set of material, can a computer make meaningful creative decisions? If so, what kind?

Finally, this paper discusses the application of statistical language processing tools to the massive sets of personal material accumulated online to create hyper-personal interactive experience. With this online accretion of personal material, our contemporary self-representation of identity has become in some sense the aggregate of these digital-material traces, begging for exploitation via the technologies of index, analysis, and transformation (as described in the previous section) to produce new, personal interactive experience. Initial possibilities are described and discussed.


Text as Best Index


  • online culture produces massive, language-annotated media datasets, (e-mails, twitter, text message, any direct messaging), and (video, audio, photos)
  • Text as best index
  • Computational NLP as tool.

What will the program do?

  • Speculative exploration (in the realm of ideas) ... grow something beyond its bounds. (Diagram 1) ideas ?= language.
  • retrieval (humans do that). search engines. key-phrase. (semantically) related search.
  • summary. key terms. key phrases (of the form "VERB to VERB", "NOUN modifier", "modifier NOUN", etc.). in engineering, the field of IR (Information Retrieval).
  • ordering/sequencing. (the production of cinematic experience?) ruminatory... wandering... associative": "ocean"->"sky"->"blue"->"red"->"nail polish"->
  • art object as parameterizable object, template: <title><ostensible subject><dimensions><medium><material>

semantic gaussian blur

  • nearest neighbor replacement
  • analagous to associative free-thinking

Concerns with my approaches

  • ...but randomness is not good enough! don't remove the human from the loop, HI remains key criteria/filter. when do we like randomness (what point is made)? when do we not?
  • ...is it more interesting with intent? giving a system intent AI/ugh. If not intent, how about peroccupations/fixations.

Megahal Grandmommy

Language Brain