Tingle's Technology Notes

I am the Technical Lead for CDL Digital Special Collections, these are some notes of mine, views expressed are my own or the original author's and do not represent my employer. http://btingle.bitbucket.org/
Recent Tweets @tingletech
Posts I Like
Who I Follow
top 100 tags
Posts tagged "Chicago DHCS"

Chicago Digital Humanities Plenary Session

Jon Orwant (Google): More Stuff and more things to do with it. A talk with responses by Neil Fraistat (Maryland), and Jeremy York (Michigan)

Brian Tingle’s notes from Jon’s talk [google doc reader] [publish to the web version]

Two stacks

  1. Computation Stack
  2. Linguistic Stack

Computation Stack

Computation

Query

Data

Semantic Stack

ideas

facts, sentences, quotations

parts of speech

structure

letters, punctuation, symbols

text, pictures

pages

Move computing to the data (too much data to move to each researcher)

ex post facto criteria for google DH grants

Is it feasible?

  1. unrealistic requirements for clean data
  2. AI-ex-machin

Is it worthwhile?

  1. someone else doing it?
  2. is it too easy?
  3. is it broad enough?

Can we provide needed assess?

Does it enable other researchers?

Does it illustrate a new style?

  1. refactoring the victorian
  2. what walter holten did

Move up a level in a stack.

Improve level in a stack (reuse-able data set)

Or reasonable using one level as a proxy for a higher level (i.e. structure as a proxy for ideas)

Moving up a level by creating tools (intralanguage translations {early english to modern english})

during the comments from the others this was all I wrote down

corpra space design in bamboo

DAVILA is an open source relational database schema visualization and annotation tool. It is written in Processing using the toxiclibs physics library and released under the GPLv3.

Currently MorphAdorner provides methods for adorning text with standard spellings, parts of speech and lemmata. MorphAdorner also provides facilities for tokenizing text, recognizing sentence boundaries, and extracting names and places.

“Abbot: uses small XSLT stylesheet to dynamically write a big one (9,000 lines!) kind of an XSLT generator” (@trevormunoz )

“MONK is a digital environment designed to help humanities scholars discover and analyze patterns in the texts they study (see the table at the bottom of this page for detailed information on what is in the MONK datastore, and terms of availability). The MONK project has been generously supported by the Andrew W. Mellon Foundation, from 2007-2009, and InCommon integration has been supported in 2009 by the CIC Library Directors. All code produced by the project is open source. MONK has a publicly available instance with texts contributed by Indiana University, the University of North Carolina at Chapel Hill, the University of Virginia, and Martin Mueller at Northwestern University.”

“IDP is a ground-breaking international collaboration to make information and images of all manuscripts, paintings, textiles and artefacts from Dunhuang and archaeological sites of the Eastern Silk Road freely available on the Internet and to encourage their use through educational and research programmes.”