Introduction
Khmer
        Dictionary
        Analysis
        Corpus
        Bitext
        Images

Classical Southeast Asian Texts and Reference
Southeast Asia's golden age of epigraphy extends from roughly the 5th through the 15th centuries. The SEAclassics Library of epigraphic texts, Indic and epigraphic dictionaries, and research-oriented software tools will, for the first time, make this widely scattered body of work accessible to the international scholarly community.
   The SEAclassics Library will make a significant contribution to scholarly practice, presenting these texts as a unified regional resource, using a consistent transcription system, and allowing sub-corpora to be defined by time or location as well as language or culture. For the first time it will be possible to track references to goods, geographic features, or cultural practices, or to trace specific terms derived from Sanskrit or Pali - not to mention the developement of language itself - throughout the region's interlocking cultural spaces.
Project Scope
SEAclassics' initial focus will be on the mainland Southeast Asian inscriptions, including Cham, Mon, Khmer, Pyu, Burmese, and Tai texts. These texts are seldom available in the US, even in print form, and material is often unavailable in English. We will create a uniform reference framework that provides access to:
 -  on-line text corpora, based on the best available published transcriptions (and transliterate those, like the Sukhothai texts, typically available only in modern local orthography).
 -  lexicographic references to the inscriptions, including Jenner's Angkorian and pre-Angkorian dictionaries, and Shorto's dictionary of the Mon inscriptions (and translate those, like the Sukhothai glossaries, that are not available in English).
 -  the standard Indic references, including Turner's authoritative Comparative Dictionary of Indo-Aryan Languages, Burrow & Emeneau's Dravidian Etymological Dictionary, and Monier-Williams' and Rhys-Davids' classic Sanskrit and Pali dictionaries.
Project Implementation
SEAclassics will be standards-based, and built along lines similar to EpiDoc-based resources like the US Epigraphy project, or the ad hoc Perseus project. However, we will develop considerably more sophisticated tools for:
 -  corpus analysis, providing functionality for searches by the content of word definitions (e.g. all 'gold' words), or for reverse searches by Sanskrit or Pali root, or for comparing the content of sub-corpora (e.g. comparing the lexicon of Thailand's contested Inscription One against the rest of the Sukhothai corpus).
 -  linguistic analysis, including tools for collocate analysis, exicography, and study of larger-scale syntactic features.
 -  dictionary reference, including intelligent approximation and 'assisted search' algorithms, and the ability to search Indic references using modern SEA orthographies.
   We do not anticipate wide access to images of the inscriptions at this point, but will build in functionality necessary to manage an image library tied to the searchable text corpus. Nor are reliable English readings of the texts are generally available; we will, however, create bitext corpus and search functionality.
Project Audience
Epigraphic inscriptions help tell the story of the Southeast Asian art, ecology, and economy, and of the region's political, religious, and cultural development. SEAclassics resources will find an international scholarly audience, including researchers in linguistics (including historical and comparative lingusitics), lexicography (both modern and historical), epigraphy and philology, history, political science, anthropology, art history, and religion.
   There is a tremendous need for the SEAclassics library, particularly within the United States. Despite the efforts of eight SEA National Research Centers, and collaborative groups like CORMOSEA (the national SEA libraries council) and CAPSEA (the Library of Congress-organized cooperative acquisitions program for SEA), Southeast Asian Studies has been in steady decline since the 1970's. By helping to level the extraordinarily steep learning curve required to do research - even modern SEA has ten or more different, widely used non-Roman writing systems - the SEAclassics library wll help prompt a new kind of regionally oriented SEA scholarship.
   We do not wish to oversell SEAclassics - we will not re-evaluate the existing transcriptions or write new dictionaries. We will, however, make a century of scholarship, now scattered and available only in paper form, as accessible as equivalent resources for Greek and Latin. In addition, we will make a substantial contribution to accurately documenting both classical and modern Indic etymology throughout the region.
Advisory Board
Preliminary contacts make it clear that we will have no difficulty attracting leading international scholars to the SEAclassics advisory / consulting board. For example, Prof. Philip Jenner, has agreed to let the SEAclassics Library distribute his pre-Angkor and Angkorian dictionaries, and has offered to help move the project forward in any way possible. There has been an equally enthusiastic response to the prospect of improved access to Sanskrit/Pali/Dravidian lexicographic resources, above and beyond their relevance to Southeast Asia. A complete board, with experts in all relevant areas, will be in place before we make any formal funding requests.
Institutional Context
The Center for Research in Computational Linguistics (CRCL Inc., a US 501(c)3 nonprofit) is equally well-placed. CRCL has extensive experience building research-ready on-line scholarly resources. For example, CRCL is implementor and co-sponsor (with the US Dep't of Education) of the SEAlang Library of modern bilingual dictionaries and mono/bilingual text corpora, and prepared HL Shorto's unpublished Dictionary of Comparative Mon-Khmer (Dr. Paul Sidwell, Editor) for print and electronic publication.
   The SEAlang Library will also play a significant supporting role. Between 2006 - 2009, ten leading Southeast Asian bilingual dictionaries, including Haas for Thai, Headley '77 and '97 for Khmer, Shorto for Mon, etc. have been made freely accessible on-line. These resources will facilitate use of the SEAclassics Library, which will in turn allow for much-needed improvements to modern reference materials. We also have every reason to expect that the international cooperation we received on the SEAlang Library, including support from both institutions (including the Ecole francais d'Etreme Orient, by far the best-established institution in Southeast Asia) and professional organizations (including the Committee on Research Materials for Southeast Asia) will extend to the SEAclassics Library as well.