Error message
Theory, DH, and Noticing
There was a renewed flurry of thinking and observing about the relationship between theory and DH, especially around the "less yack/more hack" approach to THATCamps. I'd like to pick up a bit on the comment that I left on "When DH Was In Vogue; or, THATCamp Theory" , and respond a little to Jean Bauer's "Who You Calling Untheoretical?".
In my comment on "When DH Was In Vogue", I was worried about the perception that the THATCamp Bootcamps represented a colonizing of the humanisties by computer science, and argue that it's really the reverse. The Bootcamps are about the humanities pushing into computer science. Picking up from there, I feel a real tension about that. After all, the principles of writing code seem antithetical to much of how the humanities works. Coders like to keep things simple. We like to produce the most efficient algorithms, with very well-defined inputs and outputs. Especially with test-driven development, the code we write should have very precisely-defined functionality and purposes, and once written shouldn't change (or break).
Roughly, that's good coding, and good project management. Breaking those principles leads to scope creep, and to not getting projects done. A plugin for Omeka that I started years ago is still unfinished, because each time I built a new piece of functionality, I noticed that it could be expanded upon to produce new, more interesting results. Parts of the input (RSS/Atom feeds) could be interpreted in a variety of ways, and so I wrote code to make the various meanings explicit. There are assumptions and ambiguities built into the input formats that I wanted to unpack and explicate for the user.
"expand upon", "make various meanings explicit", "unpack and explicate". These are the words of a dissertation writer. I was writing code like I was writing my dissertation, which led to what a computer scientist or project manager will call scope creep.
That's why I'm so impressed with Jean's ability to build awesome things, with theory and the humanities in mind, and actually, you know, finish them.
But I'm not terribly surprised by the reaction that she describes, of people asking, "But where's the theory?" I'm not surprised, because, in general, I don't think theorists know what to notice yet. Our graduate programs are naturally still dominated by fairly traditional "texts", and theory rooted in them. And our theory has done a good job of complicating various notions of "text" (note to non-humanist coders: complicating ideas is a good and productive thing to theorists). Whatever our theoretical approach to whatever we are calling a text, we know what to notice in them. That's what much of our grad school training in theory is about -- what do we need to notice in the text, and how does our theory help us describe what we have noticed?
This is not surprising either, because unlike any of the various theoretical notions of text floating around, most people don't have an experiential grounding in database as text or API as text. As I said in my comment on "When DH Was In Vogue", one doesn't need to be a poet to do literary criticism on poetry, though the poet and critic will share a common knowledge base. But (unless you are a medievalist or classicist or doing comparative lit.) you don't need any special training to be able to begin noticing things in the text. You already know the language, and so you can begin your training in noticing right away.
Not so in the digital world. Indeed, a good user interface is designed specifically so that you don't have to deal with the inner workings of the application. In general, people should not see the internal structures of an application -- the database, the public and private methods in the core code. And so, there is no ability to even begin noticing what's notable.
This is where crit-code studies seems promising. There might be strong analogies to crit code training and methods and comparative literature, but that's something for someone more knowledgeable about both of them to figure out.
I'm not sure I'd go so far as to say that to do theory in/on DH one needs to learn to code or design a database. But one does need some training to be able to start noticing the difference between two data models that at surface appear to describe the same things. And, coders should be ready to learn what useful things theorists can offer that, despite a first appearance of scope creep, might just be valuable things to consider building into the code.
As a concrete example, I sometimes fret over the centrality of the item in Omeka. An "item" is the fundamental unit of information, and we have lots of ways to describe them with metadata, mostly dublin core. But "items" can become complicated very quickly. Some of my fellow CHNMers are struggling with how to put scrapbooks into Omeka. Is the scrapbook the item, or is each individual thing in the scrapbook an item? Where do pages in the scrapbook fit in? I'm guessing that there is theory that could be useful here, and could lead to an Omeka plugin that is designed to implement a theoretical approach to scrapbooks that lets us work with a more complex notion of "item" in our data model. I need some theory to help me notice things about scrapbooks, and to help me notice things about Omeka's notion of "item".
Producing a plugin that complicates Omeka's model of "item" by consciously building theory into the code would be a great code hack in harmony with theory yack.
Add comment
"Any medium powerful enough to extend man's reach is powerful enough to topple his world. To get the medium's magic to work for one's aims rather than against them is to attain literacy."
-- Alan Kay, "Computer Software", Scientific American, September 1984
I'm patrick_mj on Twitter
Comments
Love this post!
Patrick,
I completely love this post. I think your idea about complicating the "item" of Omeka is really really awesome. And I have nothing to add, just that I hope you help us figure out what THATCamp Theory will be/could be all about. You've already started. :)
Pragmatism v theory?
If I understand the example correctly, it is representative of many similar cases where the issue is how deeply to code a collection. I think the common assumption is that users would find metadata or RDF triples valuable all the way down to the smallest meme item level. However, resources constrain us, so the depth of coding is dictated by purely pragmatic concerns. Would theory change that?
I think the theory would --
I think the theory would -- and should -- come into play in defining RDF Classes and Properties (or other metadata). That's where there is already theory at work, but is often left implicit. A study of the evolution of BIBO, for example, would be a great target for theory. RDF offers a nice possibility to define metadata specifically from the perspective of a particular theory.
I like that you've brought up
I like that you've brought up the complications with the "item." Often this is a discussion that we have at Intro to Omeka workshops and demonstrations. I think that the item is challenges most often because of Omeka's flexibility as a web publishing system. While designed for cultural heritage objects, you can also use the system and adapt the data model for doing other types of analysis. I think that if you want to theorize the "item," it should be thought of having multiple theories, then, which are based on context.
For example, in a museum, the item is most closely aligned with an "object." An object is a thing that is assigned an accession number and all of its parts carry that # as its base ID. That thing is classified with controlled vocabulary, often from guides such as Nomenclature for Museum Cataloging. So in a museum, a scrapbook is the object and its pages are part of the original item. For a researcher/scholar who uses material culture approaches to examining that scrapbook she would look at it as a whole entity and the process, materials, intentions that went into creating that object. For example, here is John Lennon's stamp album, the album is the object/item w/one accession number, and each page is represented with a different image file that is associated with the item: http://arago.si.edu/index.asp?con=2&cmd=1&id=173567&pg=1&mode=1&tid=2040893&
In another context, if the scrapbook is the only thing one is examining for intense literary analysis, then each Scrapbook page might be an item with its own descriptive metadata.
Scale is another consideration. If you are working with one piece of evidence or text--ie a Scrapbook, you can more easily represent and describe each page as a separate item. If you have a collection of 1000 or 10K, 100K albums, scrapbooks, and other things you probably do not wish to break down and separate pieces of objects into separate parts, digitally, by making each items.
I'm kind of getting off track, but you see where I'm going. The item is challenging but I kind of like that it is flexible and can be what someone wants it to be based on their disciplinary, scholarly, professional needs, conventions, and practices.