<?xml version="1.0" encoding="utf-8"?>
<?oxygen RNGSchema="../../common/schema/DHQauthor-TEI.rng" type="xml"?>
<?oxygen SCHSchema="../../common/schema/dhqTEI-ready.sch"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0"
     xmlns:dhq="http://www.digitalhumanities.org/ns/dhq"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
     xmlns:cc="http://web.resource.org/cc/">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>Large-Scale Humanities Computing Projects: Snakes Eating Tails, or Every End is a New
      Beginning? </title>
            <author>William A. Kretzschmar, Jr.</author>
            <dhq:authorInfo>
               <dhq:author_name>William A. <dhq:family>Kretzschmar, Jr.</dhq:family>
               </dhq:author_name>
               <dhq:affiliation>University of Georgia</dhq:affiliation>
               <email>kretzsch@uga.edu</email>
               <dhq:bio>
                  <p>William A. Kretzschmar, Jr. is Harry and Jane Willson Professor in Humanities  at the University of Georgia, where he teaches English and Linguistics.  His major publications include <title rend="italic">The Linguistics of Speech</title> (Cambridge U Press, 2009); the <title rend="italic">Oxford Dictionary of Pronunciation for Current English</title> (with Clive Upton and Rafal Konopka; Oxford U Press, 2001); <title rend="italic">Introduction to Quantitative Analysis of Linguistic Survey Data</title> (with Edgar Schneider, Sage Publications, 1996); <title rend="italic">Handbook of the Linguistic Atlas of the Middle and South Atlantic States</title> (with Virginia McDavid, Theodore Lerud, and Ellen Johnson; U Chicago Press, 1994). He served as editor of the <title rend="italic">Journal of English Linguistics</title> for 15 years and now serves as board member for various professional journals, atlases, and dictionaries, including preparation of American pronunciations for the new online Oxford English Dictionary.  He is also the Editor of the American Linguistic Atlas Project, the oldest and largest national research project to study how people speak differently in different parts of the country.</p>
               </dhq:bio>
            </dhq:authorInfo>
         </titleStmt>
         <publicationStmt>
            <idno type="DHQarticle-id">000038</idno>
            <idno type="volume">003</idno>
            <idno type="issue">2</idno>
            <dhq:articleType>article</dhq:articleType>
            <date when="2009-06-18">18 June 2009</date>
            <availability>
               <cc:License xmlns="http://digitalhumanities.org/DHQ/namespace"
                           rdf:about="http://creativecommons.org/licenses/by-nc-nd/2.5/"/>
            </availability>
         </publicationStmt>
         <sourceDesc>
            <p>Authored for DHQ; migrated from original DHQauthor format</p>
         </sourceDesc>
      </fileDesc>
      <encodingDesc>
         <classDecl>
            <taxonomy xml:id="dhq_keywords">
               <bibl>DHQ classification scheme; full list available in the <ref target="http://www.digitalhumanities.org/dhq/taxonomy.xml">DHQ keyword taxonomy</ref>
               </bibl>
            </taxonomy>
            <taxonomy xml:id="authorial_keywords">
               <bibl>Keywords supplied by author; no controlled vocabulary</bibl>
            </taxonomy>
         </classDecl>
      </encodingDesc>
      <profileDesc>
         <langUsage>
            <language ident="en"/>
         </langUsage>
      </profileDesc>
      <revisionDesc>
          <change>Encoded by the author</change>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <dhq:abstract>
            <p>The word <q>finish</q> can mean two things that have quite different implications for large-scale humanities computing projects: <quote rend="inline">to bring to completion; to make or perform completely; to complete</quote> and also <quote rend="inline">to perfect finally or in detail; to put the final and completing touches to (a thing).</quote> The word <q>finish</q> is just not part of the deal for the Linguistic Atlas Project in either sense.  However, granting agencies must ask <q>what do you want money for this time?</q> and, from this viewpoint, the Atlas Project consists of a series of particular tasks or experiments, each one of which is capable of being <q>finished</q> in both senses of the word. This paper discusses the reality of funding, deadlines, and deliverables, as they relate to the sequence of tasks that make up the larger Atlas Project. There are no once-and-done, permanent solutions.  The largest humanities computing projects require continuing care and maintenance, and the best way forward is to create some sort of stable institutional setting for large projects that will provide continuity and baseline resources for the work.</p>
         </dhq:abstract>
         <dhq:teaser>
            <p>Completion of large-scale humanities computing projects is an illusion: they need to be  rebuilt continuously in order to survive.</p>
         </dhq:teaser>
      </front>
      <body>
         <head>Large-Scale Humanities Computing Projects: Snakes Eating Tails, or Every End is a New
      Beginning</head>
         <p>One of the motivating questions for this cluster of essays is <q>What does it mean to
        finish something?</q> As it happens, the word <q>finish</q> can mean two
      things that have quite different implications for large-scale humanities computing projects.
      On one hand, according to the <emph>OED</emph>,
      <q>finish</q> can mean <q>To bring to completion; to make or perform completely; to
      complete.</q> On the other hand, the word can also mean <q>To perfect finally or in detail; to put
      the final and completing touches to (a thing).</q> In my own work of this kind, the American
        Linguistic Atlas project (<ref target="http://www.lap.uga.edu">http://www.lap.uga.edu</ref>), we do neither of these
      things. We cannot come to an end of the work because we are witnesses and archivists of how
      Americans talk, and they keep talking differently across time and space. Neither do I think
      that our humanities-computing representation of our research is capable of being finally
      perfected, of achieving some perfect state, because technology keeps changing and the demands
      placed upon our research keep changing. If we view the entirety of the Linguistic Atlas
      Project as a <q>large-scale humanities computing project,</q> the word
        <q>finish</q> is just not part of the deal. And we are not alone. While the
      creation of, say, a variorum edition may seem like a project that can be finished in both
      senses, actually we need to make new editions all the time, since our idea of how to make the
      best edition changes as trends in scholarship change, especially now in the digital age when
      new technical possibilities keep emerging.</p>
         <p>However, it is quite reasonable to ask, as our granting agencies must ask, <q>What do you want
      money for this time?</q> or <q>Did you accomplish what we gave you money to do?</q> From this
      viewpoint, the Atlas Project, as an example to stand in for any large scale project, consists
      of a series of particular tasks or experiments, each one of which is capable of being
        <q>finished</q> in both senses of the word. It is these separate tasks that are
      fundable, and for which we can claim to have done what we said we would do. In this paper, I
      would like to discuss the sequence of tasks that make up the larger Atlas Project in order to
      show the special character of work done deliberately as part of a sequence for a large-scale
      project, as opposed to work proposed as a singular task. The contextualization of the separate
      tasks leads to special cases of what it means to <q>finish</q> the work in either
      sense. The point of what follows is not the Atlas Project itself, but instead the way that
      individual tasks respond to the technical and academic situation at the time. Our own
      technical work on the Atlas has responded to the microcomputer revolution, to the emergence of
      the Web, and to the development of text-encoding and computer multimedia. Our work has helped
      to drive changes in the academic study of language variation, from traditional dialect maps to
      use of rapid visualization methods and statistical processing for both text and audio data.
      The scope of these changes show how our work and thinking over the years have had to change
      and must continue to change, just as they must for other large-scale projects like major
      editions and dictionaries, so that we can avoid the charge of being the snake that eats its
      own tail. </p>
         <p>The Linguistic Atlas Project, per se, has been notable over many years for its twin goals of
      interactivity for research (including the use of GIS) and making its data sets accessible and
      available to other researchers and to the public. I first programmed a GIS system for our
      Linguistic Atlas data in 1990, presented at the ACH/ALLC meeting in Tucson in 1991 <ref target="#figure01">Figure 1</ref>; <ptr target="#kirk1991"/>; <ptr target="#kirk1992"/>;
        <ptr target="#kretzschmar1992"/>. The program, called LAMSASplot, took advantage of work we did
      with funding from NEH to keyboard words and phrases from our survey interviews so that they
      would be available for computer-assisted analysis. The LAMSASplot system was widely used for
      teaching and research on American English in the early 1990s, and it immediately led to
      breakthroughs in how we were able to think about language variation data <ptr target="#kretzschmar1994"/>, <ptr target="#kretzschmar1996a"/>; <ptr target="#lee1993"/>. The
      two most important design elements of that first system were the central column, which shows
      the frequency of occurrence of a word of interest, both by speaker and by community (most
      communities have more than one speaker), and the rightmost column, the GIS implementation.
      That column is composed of two layers, a standard base map layer that shows the state
      outlines, and a second layer consisting of plotted points at community locations that is
      generated according to the evidence for the word selected.</p>
         <figure xml:id="figure01">
            <graphic url="resources/images/figure01.jpg"/>
            <figDesc>Screenshot of the LAMSASplot</figDesc>
            <dhq:caption>LAMSASplot on the Mac platform, c. 1990</dhq:caption>
         </figure>
         <p>The LAMSAS GIS programs were prepared on Macintosh computers using the Foxbase relational
      database package. Foxbase permitted us to associate graphical coordinates with our linguistic
      data within a relational structure. Generation of these plots took only 90 seconds on the
      desktop computers of the day, at least a 250-fold improvement over the hours required for the
      charting that had to be done by hand by the pioneers in the field, Hans Kurath and Raven
      McDavid. The other advantage of LAMSASplot is that it made frequency counts and plotted one
      variant at a time. This practice was in sharp contrast to the method used by Kurath and
      McDavid for charting language data, using isoglosses, lines which showed the limit of
      occurrence of particular variants. As our pail map suggests here, lines just cannot do justice
      to the geographical distributions of linguistic features. Our GIS solution thus launched a new
      line of analysis for dialectologists based on differential frequency in feature distributions.
      The LAMSASplot humanities computing application not only used available technology to automate
      mapmaking, it provided a theoretical advance in how the data was analyzed. The Atlas project
      was thus one of many involved in the burgeoning field of computer-assisted scientific
      visualization, not just in the humanities, but the particular point of talking about the Atlas
      is to show how such large trends have particular effects in particular areas of study.</p>
         <p>We did not finish keyboarding all of our LAMSAS data with our NEH funding, which had been
      proposed as the first of a sequence of grants. We could show immediate benefits of getting the
      process started, in part through LAMSASplot, but unfortunately NEH did not fund our proposals
      to continue digitization of the data. There were various reasons offered by the panelists over
      the years, but I suspect that the refusals came down to the fact that the panelists preferred
      to fund new work over our historical interviews. The immediate analytical task for the Atlas
      was thus <q>finished</q> in both senses, both completed and refined, but only the
      immediate task; we were shut out from long-term funding to <q>finish</q> the whole
      project. </p>
         <figure xml:id="figure02">
            <graphic url="resources/images/figure02.jpg"/>
            <figDesc>Map of states used in Linguistic Atlas Projects</figDesc>
            <dhq:caption>Atlas Web Site, Mark 1</dhq:caption>
         </figure>
         <p>As the next task in our humanities computing implementation, we then ported the system to the
      Web, which I first demonstrated in 1996 (<ptr target="#kretzschmar1996b"/>; <ref target="#figure02">Figure 2</ref> shows the site as of 1998). We had been working on an
      interactive ftp/gopher system as early as 1993, but when Web technology became available we
      saw that it enabled perfectly what we had been attempting from another direction. The Web
      allowed us to make our data available to a wide public, with many additional interactive GIS
      features for locating speakers and information for the LAMSAS survey, and with static pages
      for the other regional surveys. We added additional data for our African American LAMSAS
      speakers plus data from speakers of Sea Island Creole (Gullah) in 1998 with funding from NSF.
      The GIS maps from LAMSASplot were ported to the Web, as in <ref target="#figure03">Figure
        3</ref> for the phrase <emph>quarter of</emph> in telling time (as opposed to <emph>quarter till</emph>, <emph>quarter to</emph>),
      where they now could be created by users almost instantaneously with a server-side script,
      another improvement in processing time of perhaps fiftyfold. We also offered maps to show
      where all of the speakers in a state resided (<ref target="#figure04">Figure 4</ref>). These
      maps were clickable to allow users to access all of the information about any given speaker
      and the speaker's survey responses (<ref target="#figure05">Figure 5</ref>). Besides maps, we
      enabled browsing and searches of the responses and of speaker information (<ref target="#figure06">Figure 6</ref>, <ref target="#figure07">Figure 7</ref>). We also changed
      the way that we stored our data: flat, comma-delimited files instead of a proprietary
      relational format. </p>
         <figure xml:id="figure03">
            <graphic url="resources/images/figure03.jpg"/>
            <figDesc>Map of interviewee distribution</figDesc>
            <dhq:caption>LAMSAS Web: <emph>quarter of</emph>
            </dhq:caption>
         </figure>
         <figure xml:id="figure04">
            <graphic url="resources/images/figure04.jpg"/>
            <figDesc>Distribution of interviewed speakers in Maryland</figDesc>
            <dhq:caption>LAMSAS Web: state map of speakers in Maryland</dhq:caption>
         </figure>
         <figure xml:id="figure05">
            <graphic url="resources/images/figure05.jpg"/>
            <figDesc>Sample data set for interviewee</figDesc>
            <dhq:caption>LAMSAS Web: Baltimore speaker (from clicking MD13E</dhq:caption>
         </figure>
         <figure xml:id="figure06">
            <graphic url="resources/images/figure06.jpg"/>
            <figDesc>Screenshot of the LAMSAS site</figDesc>
            <dhq:caption>LAMSAS Web: Browse Screen</dhq:caption>
         </figure>
         <figure xml:id="figure07">
            <graphic url="resources/images/figure07.jpg"/>
            <figDesc>Screenshot of LAMSAS informant database</figDesc>
            <dhq:caption>LAMSAS Web: Speakers (click of Figure 6, plain format informants</dhq:caption>
         </figure>
         <p>The Atlas Web was a significant advance for both teaching and research (<ptr target="#kretzschmar1997"/>, <ptr target="#kretzschmar2002a"/>), in line with the goals for an
      electronic atlas first set forth nearly a decade earlier <ptr target="#kretzschmar1988"/>. 
      The plotted maps and much of the data were the same as what was available for the Mac
      LAMSASplot package — besides the African American data, we had keyboarded some additional
      files, a few at a time as resources permit, as we are still doing — but instead of an
      incremental improvement, the Atlas Web site was actually another sea change for the project.
      For the first time the whole Atlas project was represented together, not just as separate
      regional parts, and for the first time we could offer real public access to as much data as we
      could digitize. Again, the example of the Atlas just shows the particular significance in our
      area of our participation in the main international trend for Web representation of
      information. </p>
         <p>These were good things, revolutionary in their way, but the fact is that we <emph>had</emph>
      to do something: the world had changed from mainframes and desktops to UNIX servers. We had
      actually been using both mainframe processing (for intensive statistics jobs) as well as the
      desktop Macs and PCs. If we had not acted, the proprietary software that we were using would
      have gone out of use (FoxBase was sold, and the database for our PCs, RBase, also
      disappeared), and so we would have lost our investment in preparation of proprietary data
      files. We would have missed the Web revolution. So, not to change would have meant that the
      Atlas project would be stuck on paper, where it had been before we developed any computer
      applications at all. We did benefit from the NSF grant I mentioned, and from a small internal
      University of Georgia grant that allowed us to set up our own server, both limited tasks that
      we could propose as achievable within a short time and for which we could show results. We
      still could not get funding to do the whole project, and if we had waited for that we would
      still be waiting. Every large humanities computing project faced similar problems: adapt, or
      become a footnote in academic history.</p>
         <figure xml:id="figure08">
            <graphic url="resources/images/figure08.jpg"/>
            <figDesc>Linguistic Atlas Projects homepage</figDesc>
            <dhq:caption>LAMSAS Web, Mark 2, c. 2005</dhq:caption>
         </figure>
         <p>After a while, we wanted to do new things, so we began work on a major revision of the Web
      site that came on line in 2003 (<ref target="#figure08">Figure 8</ref>, shown as of 2005). Our
      earlier GIS visualizations were just not as flexible as they needed to be to satisfy the
      demands of sociolinguistics, which had come to expect association of linguistic features with
      particular social variables (e.g. sex, age, social status). We kept the interactive plotted
      maps and layered GIS access to information, but added even more interactive choices such as
      more flexible searches and tallies of the speakers and language data set. Browse screens are
      now composed on the fly from separate data files, as in <ref target="#figure09">Figure 9</ref>
      from the quarter of data and speaker characteristics, so that the linguistic information is
      immediately associated with the social characteristics of those who said it. For the first
      time we enabled searching by pronunciation features, not just words or strings, and we offered
      the ability to do sub-searches in sequence so that users could zero-in on features and groups
      of interest (<ref target="#figure10">Figure 10</ref>). In order to accomplish these goals, we
      again changed our manner of storing and manipulating data from a homemade phonetic font to
      Unicode, and from flat files and scripted regular expressions to a MySQL implementation.
      However, we kept <q>The Old Site</q> as a link on the new one, so that long-time
      users would find what was familiar to them, and also for users who did not want the greater
      complexity that came with greater flexibility of use (see the link on the bottom left corner
      of <ref target="#figure08">Figure 8</ref>). We could not just move it, however, because
        <q>The Old Site</q> had to be compatible with Unicode and with the extensive
      Python scripting that ran functions on the New Site. The task of importing the Mac-based GIS
      system to the Web was complete by 1996, but was finished, in that second sense, in 2003 with
      the platform change and the new touches of the more flexible site. Again, other large-scale
      humanities computing projects faced similar problems; we all had to adapt to new scripting
      languages and environments, new tools like MySQL and Unicode, and new scholarly trends and
      demands in our particular areas. </p>
         <p>The transition to the Web and the further refinement of our Web tools in the late 1990s and
      early years of the new century were not accomplished by means of external funding from federal
      sources. We did win awards to record the African American data and to conduct new kinds of
      research (e.g., NSF awards to explore neural-network analysis of Atlas data, and to conduct a
      pilot project for random-sample survey research in Atlanta), and some of these resources were
      applied to computer work, but no award specifically funded our Web innovations. Development of
      our Web site was primarily accomplished by two graduate assistants, Rafal Konopka and Eric
      Rochester, who worked on Atlas computer projects for twelve years between them, first Konopka
      through our initial Web implementation, then Rochester through the first major revision. They
      held graduate assistantships, sometimes fully funded for research but more often cobbled
      together from different sources including both research and teaching. The consistent, stable
      element in this funding came from the Hans Kurath Fund of the American Dialect Society. The
      Kurath Fund is an endowment for the Atlas created by Raven McDavid and maintained by the
      American Dialect Society <ptr target="#kretzschmar2003"/>. The Dialect Society hold title to
      the Atlas research materials, and its Executive Council ratifies editors and advisory board
      members for the project. While the Kurath Fund could not support a complete graduate
      assistantship, it could pay enough so that other funding could round out a position.
      Similarly, while the Kurath Fund could not pay all of the operational expenses of the project,
      it could provide a key piece of equipment from time to time, repairs, or specialized supplies.
      The University of Georgia had agreed to provide space for the project and some operational
      support, in conjunction with the author's faculty appointment (a more permanent agreement may
      in time be possible). Thus the funding that allowed continuous development of the Atlas Web
      site has come from multiple sources, and still does. We are always in the position of putting
      together the pieces of the funding structure year by year. The central role of the American
      Dialect Society, however, provides the essential stability that has made continuous operation
      of the editorial site possible.</p>
         <figure xml:id="figure09">
            <graphic url="resources/images/figure09.jpg"/>
            <figDesc>Screenshot of search engine results</figDesc>
            <dhq:caption>LAMSAS Web 2: Browse Screen</dhq:caption>
         </figure>
         <figure xml:id="figure10">
            <graphic url="resources/images/figure10.jpg"/>
            <figDesc>Screenshot of search engine fields</figDesc>
            <dhq:caption>LAMSAS Web: Sub-Search</dhq:caption>
         </figure>
         <figure xml:id="figure11">
            <graphic url="resources/images/figure11.jpg"/>
            <figDesc>Screenshot of similar work in the Netherlands</figDesc>
            <dhq:caption>LAMSAS Web: Work by Others</dhq:caption>
         </figure>
         <p>The additional changes for the LAMSAS Web Mark 2 may seem more incremental than
      revolutionary, but again we feel that they responded to a changing environment that could not
      be ignored, new scholarly imperatives. For the first time our linguistic survey data was now
      fully available for sociolinguistic research. Dialectology is a venerable pursuit in
      linguistics, but it had been overtaken by sociolinguistics in recent decades and declared
      essentially to be irrelevant by many sociolinguists. The changes that we made to our Web site
      integrated social and geographical analysis of language variation in a single visualization
      that, in itself, showed graphically that the two approaches were not only compatible but
      inseparable. In addition, we also began to link our site with other sites, so that the Atlas
      Project could be seen in its connections with other online resources. Finally, we began to
      post completed analyses on the site in addition to raw data and associated information; some
      of these are results from our own funded research, others, such as those from John Nerbonne in
      Groningen (<ref target="#figure11">Figure 11</ref>), are from the research of people who used
      our freely-available data. In this way we could make and defend the claim that our Atlas
      survey research was a member of a wider community of those studying language variation. We did
      respond to emerging computational opportunities in the New Site, like Unicode and Python and
      MySQL, but the most significant part of the new changes spoke to issues of linguistic theory
      and analysis. And again, nobody funded completion of digitization of all of the data from the
      Atlas.</p>
         <figure xml:id="figure12">
            <graphic url="resources/images/figure12.jpg"/>
            <figDesc>Information flowchart for linguistic interviews</figDesc>
            <dhq:caption>Future LAMSAS Web</dhq:caption>
         </figure>
         <p> Still, the larger Atlas Project is nowhere near at an end. We are now rethinking what the
      site should do, from a text-based system to one that features audio and stored images along
      with text and GIS. This change has become possible only in the last two years, as much greater
      network-attached storage has become available (measured in Tb, before long Pb). Because of our
      archival audio files, we are now one of the largest clients at the University of Georgia
      institutional storage array (which we share with bio-informaticists, physicists, and others
      usually considered to be power users). We now conceive of our new interviews as conversational
      corpora, in which text transcriptions serve as time-linked indices to audio files <ptr target="#kretzschmar2005a"/>; <ptr target="#kretzschmar2006"/>. <ref target="#figure12">Figure
        12</ref> charts the flow of information that we envision for the next-generation site. While
      many users will want to listen to our speakers, others will want to perform acoustical
      analyses, now a strong trend in language variation research, as we ourselves now perform them
      (e.g., <ptr target="#kretzschmar2004"/>; <ptr target="#kretzschmar2005b"/>). Our next task is to
      integrate sound with text and to enable acoustical research functions, while maintaining our
      interactive GIS functions — a whole new set of tools and problems from the previous task <ptr target="#kretzschmar2002b"/>. As the flowchart suggests, we now envision eight different
      outputs from a new Web presence: full text transcripts, linked sound, acoustical phonetic data
      in lists and plots, tallies of feature variants in lists and on maps, and technical
      statistical results in both lists and maps. While any single task from this list is possible,
      some will be difficult to automate, and the hardest thing of all will be to integrate all
      eight outputs. We have not yet made these changes; they have been designed for some time now,
      and await the right people and circumstances and resources to make them real (our major
      revision cycle seems to be 5 to 10 years). When we can do that, we will have integrated our
      existing modes of study for language variation, surveys and sociolinguistics, with corpus
      linguistics, the latest major contemporary approach to language variation. We will have
      provided not just words and phrases extracted from interviews, but a full complement of
      information in our interviews, from sound recording to transcripts to analyses of acoustical
      characteristics. </p>
         <p>So, will we ever be <q>finished</q>? Yes, with the GIS of LAMSASplot and of
        <q>The Old Site,</q> though still (as always) tweaking the current site. We have
      plans to make yet another thorough revision. But we will never be finished either with keeping
      older sites and available or with creating new visualizations for the information we keep, as
      new technical possibilities and research demands appear. We can complete particular tasks, and
      often we can even <q>finish</q> particular tasks in the sense of polishing them for
      improved use. Yet one research proposal does not make the whole research program. While we can
      often fund and succeed with individual tasks, we must always see tasks as part of the larger
      process that probably will not be funded completely but still must continue for future users.
      We would be the snake eating its own tail if all we did was keep polishing eternally a single
      task (or worse, the dog that pointlessly chases his own tail), but we do well to make every
      end a new beginning as new technological possibilities become available, and new theories and
      styles of analyses take hold. </p>
         <p>After twenty-five years of trying to apply humanities computing to the problems of the
      Linguistic Atlas, it has become clear that there are no once-and-done, permanent solutions.
      The largest humanities computing projects are likely to require continuing care and
      maintenance, if not more radical representation and reinterpretation in light of the advance
      of scholarship, and yet they seem unlikely ever to be funded comprehensively for these tasks.
      The best way forward is to create some sort of stable institutional setting for large projects
      that will provide continuity and baseline resources for the work. This we have done for the
      Linguistic Atlas, through the American Dialect Society and our association with the University
      of Georgia. Stable institutional settings allow for additional resources to be sought for
      individual tasks, as the need for each one becomes apparent. In my general area, the
      Dictionary of American Regional English project at Wisconsin and the Dictionary of Old English
      project at Toronto, both innovative users of humanities computing, have found long-term
      funding through NEH and have also had strong institutional support. We can hope that Wisconsin
      and Toronto will not end their association with these projects when their first editions are
      completed. A good model for what might happen has occurred at Michigan, where the completion
      of the (print) <emph>Middle English Dictionary</emph> was followed by the creation of the
        electronic <ref target="http://quod.lib.umich.edu/m/mec/">
               <title>Middle English Compendium</title>
            </ref> 
            <ptr target="#mcsparran2000"/> in
        conjunction with the University of Michigan Library. Conversion was assisted by a grant from NEH.
      We are now attempting something of the kind at Georgia, with our new grant from NEH which will
      allow us to make digital conversions of the existing audio-taped interviews from the
      Linguistic Atlas, and which will also help us to make a new public audio archive of these
      materials. </p>
         <p>Thus, at the end of the day, I recommend that, besides grants for new tasks and new work,
      granting agencies should consider assisting in the formation of enduring institutional
      structures to support large-scale projects in humanities computing. Some existing programs,
      like NEH Challenge Grants, may already be helpful for this purpose. The establishment of
      common interests between the library community and researchers in humanities computing,
      already well started in the ACH and ALLC and exemplified by projects like the <emph>Middle
        English Compendium</emph>, is an excellent trend to continue. We will probably never be able
      to fund big projects like mine at the level needed to finish them completely. But that is OK,
      because the idea of finishing big humanities computing projects once and for all is just an
      illusion. We will always need to address changes in technology and changes in our disciplines,
      even for subject materials that are historical and unchanging. We need, therefore, to accept
      the rule for Talmudic scholarship that my colleague Virginia McDavid once passed along to me:
      <quote rend="inline">You do not have to finish the work, but neither may you desist from it.</quote> We just have to keep
      at it, and find the resources we need along the way.</p>
      </body>
      <back>
         <listBibl>
            <bibl xml:id="kirk1991" label="Kirk and Kretzschmar 1991">
      Kirk, John, and William A. Kretzschmar, Jr. <title rend="italic">The Analysis and Interpretation of Dialect
        Databases by Interactive Mapping.</title> ACH/ALLC Conference, Tempe, 1991.
    </bibl>
            <bibl xml:id="kirk1992" label="Kirk and Kretzschmar 1992">
      Kirk, John, and William A. Kretzschmar, Jr. 1992. <title rend="quotes">Interactive Linguistic Mapping of
        Dialect Features</title>. <title rend="italic">Literary and Linguistic Computing</title> 7:
        168-75.
    </bibl>
            <bibl xml:id="kretzschmar1988" label="Kretzschmar 1988">
      Kretzschmar, William A. Jr. 1988. <title rend="quotes">Computers and the American Linguistic Atlas</title>. In
          <title rend="italic">Methods in Dialectology: Proceedings of the Sixth International
          Conference on Methods in Dialectology</title>, edited by Alan Thomas, 200-24. Clevedon:
        Multilingual Matters.
    </bibl>
            <bibl xml:id="kretzschmar1992" label="Kretzschmar 1992">
      Kretzschmar, William A. Jr. 1992. <title rend="quotes">Interactive Computer Mapping for the Linguistic Atlas
        of the Middle and South Atlantic States (LAMSAS)</title>. In <title rend="italic">Old English and
          New: Essays in Language and Linguistics in Honor of Frederic G. Cassidy</title>, edited by
        N. Doane, J. Hall, and R. Ringler, 400-14. New York: Garland.
    </bibl>
            <bibl xml:id="kretzschmar1994" label="Kretzschmar 1994">
      Kretzschmar, William A. Jr. 1994. <title rend="quotes">Linguistic Theory and Computer Modeling of Linguistic
        Survey Data.</title> ACH/ALLC, Paris, 1994.
    </bibl>
            <bibl xml:id="kretzschmar1996a" label="Kretzschmar 1996a">
      Kretzschmar, William A. Jr. 1996a. <title rend="quotes">Quantitative Areal Analysis of Dialect Features</title>.
          <title rend="italic">Language Variation and Change</title> 8: 13-39.
    </bibl>
            <bibl xml:id="kretzschmar1996b" label="Kretzschmar 1996b">
      Kretzschmar, William A. Jr. 1996b. The LAMSAS Internet Site, NWAVE, Las Vegas.
    </bibl>
            <bibl xml:id="kretzschmar1997" label="Kretzschmar 1997">
      Kretzschmar, William A. Jr. 1997. <title rend="quotes">Computer-Assisted Study of American English Lexical
        Data</title>. In <title rend="italic">From AElfric to the New York Times: Studies in English Corpus
          Linguistics</title>, edited by Udo Fries, Viviane Müller, and Peter Schneider, 239-47.
        Amsterdam, Atlanta: Rodopi.
    </bibl>
            <bibl xml:id="kretzschmar2002a" label="Kretzschmar 2002a">
      Kretzschmar, William A. Jr. 2002a. <title rend="quotes">Teaching American English Online</title>. <title rend="italic">Journal of English Linguistics</title> 30: 318-327.
    </bibl>
            <bibl xml:id="kretzschmar2002b" label="Kretzschmar 2002b">
      Kretzschmar, William A. Jr. 2002b. TEI and Linguistic Interviews. <ptr target="http://www.tei-c.org/"/>.
    </bibl>
            <bibl xml:id="kretzschmar2003" label="Kretzschmar 2003">
      Kretzschmar, William A. Jr. 2003. <title rend="quotes">Linguistic Atlases of the US and Canada</title>. In <title rend="italic">Needed Research in American Dialects</title>, ed. by Dennis Preston.
          <title>Publications of the American Dialect Society</title> 88: 25-48.
    </bibl>
            <bibl xml:id="kretzschmar2006" label="Kretzschmar 2006">
      Kretzschmar, William A. Jr., Jean Anderson, Joan Beal, Karen Corrigan, Lisa-Lena
        Opas-Hanninen, and Bartek Plichta. 2006. <title rend="quotes">Collaboration on Corpora for Regional and Social
        Analysis</title>. <title rend="italic">Journal of English Linguistics</title> 34: 172-205.
    </bibl>
            <bibl xml:id="kretzschmar2005a" label="Kretzschmar 2005a">
      Kretzschmar, William A. Jr., Betsy Barry, and Nicole Kong. 2005. Publication of Full
        Interviews from the Atlanta Survey Project. ADS/LSA 2005, Oakland.
    </bibl>
            <bibl xml:id="kretzschmar2005b" label="Kretzschmar 2005b">
      Kretzschmar, William A. Jr., MiRan Kim, and Nicole Kong. 2005. Vowel Formant
        Characteristics from the Atlanta Survey Project. ADS/LSA 2005, Oakland.
    </bibl>
            <bibl xml:id="kretzschmar2004" label="Kretzschmar 2004">
      Kretzschmar, William A., Jr., Sonja Lanehart, Betsy Barry, Iyabo Osiapem, and MiRan
        Kim. 2004. Atlanta in Black and White: A New Random Sample of Urban Speech. NWAVE 2004, Ann
        Arbor.
    </bibl>
            <bibl xml:id="mcsparran2000" label="McSparran 2000">
      McSparran, Frances. 2000. <title rend="italic">Middle English Compendium</title>.
        http://quod.lib.umich.edu/m/mec/ 1954-2007. <title rend="italic">Middle English
          Dictionary</title>. Ed. by Hans Kurath, Sherman Kuhn, Robert Lewis, et al. Ann Arbor:
        University of Michigan Press.
    </bibl>
            <bibl xml:id="lee1993" label="Lee and Kretzschmar 1993"> Lee, Jay, and William A. Kretzschmar Jr.  1993.  <title rend="quotes">Spatial Analysis of Linguistic Data with GIS Functions</title>. <title rend="italic">International Journal of Geographical 
          Information Systems</title> 7: 541-60. 
      </bibl>
         </listBibl>
      </back>
   </text>
</TEI>
