Several Project Bamboo members are currently in Seattle, WA at the 2012 Modern Language Association (MLA) Convention. Those attending include Neil Fraistat (University of Maryland), who is participating in a panel discussion on “#alt-ac: The Future of ‘Alternative Academic’ Careers”; Harriett Green (University of Illinois, Urbana-Champaign), whose paper “Collaborative Economies: Tools and Strategies for Scholars and Libraries” highlights Project Bamboo and the study she is conducting on scholars and digital collections; and Quinn Dombrowski (University of Chicago) who is representing Bamboo DiRT. Neil and Quinn also participated in a day-long DHCommons pre-conference workshop. Say hello to Neil (@fraistat), Harriett (@greenharr), and Quinn (@quinnanya) at MLA, and follow their conversations on Twitter.
Archive for the ‘News’ Category
How May Digital Collections Serve Scholarly Needs?
As part of the Collections Interoperability working group, we are investigating the question of scholars’ needs with digital collections: What kind of functionalities, features, and/or services do humanities scholars need in digital collections, in order for the collections to be useful in research?
The reason we ask is twofold: First, we’d like to know what types of digital collections should be prepared and incorporated into the Bamboo research platform. While there are a few all-encompassing general digital collections, such as the Hathi Trust Digital Library, there are many more digital collections with limited content or specialized focuses, and it is hard to determine how to select collections for incorporation into Bamboo.
Secondly, a larger question faces libraries and digital libraries about effective collection development strategies for digital collections: How can we build digital libraries that aren’t simply mass collections of materials or are based on libraries’ classifications, but that directly address scholars’ research needs?
To explore this question, we decided to launch a study that would create a needs assessment for scholars and digital collections. Over the summer, I worked with Indiana University librarian Angela Courtney to contact humanities librarians, digital humanities coordinators, and academic technologists at the twelve member institutions of the CIC academic consortium and participating Bamboo partner institutions. We ultimately convinced nine librarians and staffers to work with us on conducting a survey and interviews with their humanities faculty. The participating institutions are the University of Illinois at Urbana-Champaign, Indiana University, Northwestern University, University of Illinois at Chicago, University of Nebraska–Lincoln, Michigan State University, University of Iowa, University of Chicago, University of Minnesota, Penn State University, and the University of Maryland.
After months of IRB wrangling, writing the test instruments, pre-testing, and consultations, we launched the study in late October. A survey has been distributed to randomly selected faculty members in all of the English and history departments at the aforementioned institutions, and will run through December. Interviews will be conducted in November and December with select faculty members from fine arts and performing arts departments on the campuses who are involved in digital scholarship. Follow-up interviews will also be conducted with survey respondents who indicated a willingness to be interviewed.
We anticipate that this study will enable us to gain new insights into the transformations occurring in humanities research with the advent of digitized materials. An update will be forthcoming as results are analyzed this winter, and we’re excited for what the data will tell us!
Harriett Green is English and Digital Humanities Librarian at University of Illinois at Urbana-Champaign.
Bamboo Affiliates: Opening Up New Avenues for Collaboration
Project Bamboo has established an Affiliates program as a way to involve non-partner institutions who have similar interests and goals to Project Bamboo. We hope that these partnerships will mutually serve and benefit each party, creating long-term sustainability for the future.
NINES (Nineteenth-century Scholarship Online) supported by the University of Virginia and the Advanced Research Consortium (ARC) supported by the Initiative for Digital Humanities, Media, and Culture at Texas A&M University, taken together, serve as one example of alignment with Project Bamboo. The University of Alabama is also currently aligned with Project Bamboo. NINES/ARC members act as feedback channels for Bamboo by providing guidance towards scholars’ needs for Bamboo tool builders. The University of Alabama is participating in the pilot planning for Bamboo and is also contributing to the development of Bamboo DiRT. Alabama will be an early adopter of Bamboo DiRT, to meet the needs of their faculty and students in coordination with the Digital Humanities Center in the University Libraries. To learn more on Bamboo DiRT please read the recent post by Quinn Dombrowski, who is assisting the development of Bamboo DiRT.
I recently spoke to Laura Mandell, professor of English at Texas A&M University and director of ARC, on how ARC and Project Bamboo may mutually serve their communities. As a ground-up, scholarly-driven organization, Mandell sees ARC as being able to “bring the scholarly user base to Bamboo” and using Project Bamboo “to hook people up to the larger Digital Humanities community.” ARC will assist Project Bamboo in building a network by disseminating and providing test beds for technology developed through Bamboo.
Mandell says, “ARC and Bamboo are working together to shape the datasets that scholars have access to, so that they meet the highest standards and are findable. As things become digitized, search and research are not as separate as they were in the era of the book. We need to bring scholarly expertise back into the conversation; we’re working towards the same goal and hoping to meet in the middle.” Project Bamboo recognizes the advantages that both technologists and humanists bring to the table. Through the development of accessible, digital tools that focus on texts, we hope to incorporate the strengths of text-analysis, corpus search, and visualization, to allow a scholar to discover and curate content in new and exciting ways.
At the University of Alabama, Thomas C. Wilson, Associate Dean for Library Technology, has led a series of discussion initiatives with faculty and technologists on how scholarship needs may be better met at the university level. Faculty have voiced requests for a “sand-box type environment to experiment, play, learn and process.” Wilson says, “We are lacking a space where individual scholars can go, in an ad hoc way, connect with a collection, and apply tools to the collections.” This is where Project Bamboo comes in: we are in the process of building a coordinated infrastructure for scholars to take advantage of digital tool and service functionality in a research environment. By giving scholars access to collections, Bamboo is setting up a virtual space for scholars to “experiment, play, learn and process.”
Bamboo looks not only to work with faculty, but all players across the humanities — whether they be librarians, students, or the lone scholar. The strength of the digital humanities field lies in its collaborations. Project Bamboo seeks to assist these various forms of collaboration — from faculty member to librarian, teacher to student, scholar to collection, and scholar to scholar — in order to transform the future of humanities scholarship in the digital age. The Bamboo Affiliates program is one visible and concrete commitment to collaboration going forward.
Jim Muehlenberg, Assistant Director for Academic Technology at University of Wisconsin-Madison, is leading the Bamboo Affiliates program. Muehlenberg says, “We look forward to expanding the Affiliates model to humanists at smaller liberal arts colleges, many of which were active participants in the Bamboo Planning Project; we’ve been in conversation with leaders in the NITLE organization to find an effective approach to reach these scholars. The Bamboo Affiliates model should serve us well as we complete the current phase of Bamboo and enter into Phase II, and will be a cornerstone towards the project’s sustainability into the future.”
Over the course of the next few months, we will be releasing a series of demonstrators, that highlight what tools and services we are building to meet scholars’ needs. Please stay tuned to the project wiki and our website to see how you may join the Bamboo community.
Diggable Data, Scalable Reading, and New Humanities Scholarship
Later this week I will be attending the 2nd International Culture and Computing Conference at the University of Kyoto and presenting the paper “Diggable Data, Scalable Reading, and New Humanities Scholarship.” Digital Humanities is rapidly gaining a foothold in Japanese academic scholarship, and this conference features a strand devoted to new methodologies, ideas and outcomes that arise from the application of digital methods.
In this paper, co-written with Neil Fraistat, we address two interrelated gains of the digital turn in humanities scholarship – one political and the other intellectual. First is the popularizing of humanities scholarship through the opening of opportunities for transmission of sources and outputs of digital scholarship. The paper then looks at some approaches to big data in the humanities, critiquing their value and pointing to some of the methodological questions they raise. It then goes on to argue for digital textual scholarship that can move from the massive to the particular, borrowing a phrase from Martin Mueller to argue for ‘scalable reading’, ultimately explaining how Project Bamboo will support the opening up of scholarship and scalable reading through digital means.
In a 1935 article in the Yale Review the historian Robert C. Binkley wrote, “Micro-copying is a technique that will… give the reader exactly what he wants, and bring it to him wherever he wants to use it.” Binkley was an advocate of democratizing scholarship through the application of the new media technologies of the first half of the twentieth century. Similar arguments are often made today by advocates of the digital humanities. There are strong parallels between Binkley’s approach and the gains in public humanities that have arisen from the digitization of the artifacts of human culture. The ease of transmission and the relatively low-cost of delivery that digitized works allow has a democratizing effect on scholarship, engaging a much broader public in a range of scholarly activities.
The Google Ngram Viewer brought the idea of using computation to study culture to many who had previously been unaware of its potentials. The Ngram Viewer is based around a very simple idea: type in two or more words and you get a comparison of their occurrence in the Google Books corpora over time. A range of questions of interest to humanities scholars is possible: When did a word enter common usage? When did words fall out of favor? What is the historical trajectory of a concept in, for example, nineteenth-century politics? How much were people writing about a literary figure, or a work of fiction?
The paper critiques this and several other approaches to the use of big data in the analysis of texts – including Franco Moretti’s ‘Distant Reading’ of literary history – and then builds on these to argue for ‘scalable’ textual scholarship. Scalability in this context utilizes new computational approaches that allow for the interrogation of massive text objects far beyond the capability of the individual reader, while simultaneously allowing for traditional forms of close reading. Rather than only providing the opportunity for abstraction of many texts it should be possible for scholars to investigate closely the component parts that the computer utilized in obtaining the abstraction. For every step away from the text the scholar will be provided with the means to step back into the text and see the passage, stanza or phrase that is represented in the abstraction.
editor’s note: This post originally appeared on the Maryland Institute for Technology in the Humanities blog on October 18, 2011.
Bamboo DiRT: Connecting Scholars with Tools and Collections
During the Bamboo Planning Project, workshop participants expressed interest in developing a directory of tools, services, and collections that provides relevant metadata (cost, platform, etc.) as well as information about how other scholars have combined these resources to achieve their project or pedagogical goals. However, participants also noted that an information silo would be antithetical to the philosophical approach of Project Bamboo and would quickly encounter data curation challenges.
In response to this feedback from the planning workshops, Project Bamboo is developing a tool, service, and collection registry application that accommodates both the individual scholar looking for information and other platforms that could access and/or ingest the information. This application can serve as a resource discovery and tip-sharing tool for scholars, and a source of feedback for developers. For its user-facing side, this application builds upon the well-known Digital Research Tools (DiRT) wiki, a partnership reflected in its tentative name, Bamboo DiRT.
While the tools, services and collections developed by Project Bamboo are represented in Bamboo DiRT, this application also aims to capture the broader ecosystem of resources used by digital humanists and includes entries drawn from the DiRT wiki, Humanist listhost, DH Answers, and other discussion fora. Each entry includes as much information as possible about the resource, including a prose description, supported platform(s), cost, screenshots, and technical information. While Bamboo DiRT is not itself a documentation repository, it contains fields for links to end-user, API, and general technical documentation. Authenticated users can indicate that they use a particular resource (following the model of the “like” button) and add tips for other users of that resource.
The development goals for Bamboo DiRT include a robust API that will lay the groundwork for integrating Bamboo DiRT information with the Bamboo Work Spaces and affiliated digital humanities websites built on common platforms (Drupal and WordPress). For example, projects listed on DHCommons that use a particular tool could be automatically listed alongside the tool’s entry on Bamboo DiRT. CUNY Academic Commons users could search Bamboo DiRT within the Commons, and the tools they’ve indicated they use could be listed in their profile. By participating in a rich ecosystem of data exchange, Bamboo DiRT will help users build upon each others’ workflows, whether or not they involve Bamboo tools and services.
Integrating Data Preservation & Citation Services
As part of Project Bamboo, a team from UC Berkeley Information Services and Technology is working with the University of California Curation Center (UC3) and Alfresco Professional Services to make available exciting new data management services for arts and humanities scholars. When fully realized, scholars will be able to easily migrate research data from Work Spaces to the UC3 Merritt repository for long-term access and preservation. Data moved to the Merritt repository will be assigned a persistent identifier (DOI), which can be used to cite data with confidence.
Scholars need to know that the underlying data supporting their publications will be around for the long term. With new tools and standards for data publication, their data can be reused and verified, its impact measured, and their contributions recognized and rewarded. Many funding agencies are now mandating, where sensible, broad public dissemination of the products of research. This solution should go a long way towards meeting the needs of the academic community, funders and other stakeholders.
We are currently working towards a technical proof of concept, with the objective of being able to move content (data and metadata) from the Alfresco-based Bamboo Work Space to the UC3 Merritt repository. We expect to have this technical proof of concept ready by late October 2011. We will then work towards a more functional beta release (December 2011), which we expect to pilot during the spring 2012 semester. Our code and overall approach will serve as a template for connecting to other web-based scholarly services.
To read further on Alfresco-based Bamboo work please visit:
ECM Work Spaces REST Integration Page: https://wiki.projectbamboo.org/x/diB4AQ
Contact: Noah Wittman (wittman<at>berkeley<dot>edu)
Describing Collections and Collection Services for the BTP
Digital information held by libraries, museums and archives is typically isolated in individual repositories making cross-repository searching difficult, if not impossible. Users of digital resources including humanities scholars, however, often search for information or resources pertinent to their field of endeavor irrespective of where the data is held. The establishment of collection description registries, such as Research Data Australia, goes someway towards solving this problem by aggregating descriptions of datasets held by individual repositories in a structured and coherent manner to promote the reuse of data.
In the Bamboo Technology Project environment, there is a need for data to be discoverable for computer-mediated use and reuse across collections. This requires that collection descriptions include machine actionable descriptions of collection services, as well as human readable descriptions of data holdings.
Describing Collections and Collection Services for the BTP suggests that a greater use of semantic web technologies, including RDF encoding for Registry Interchange Format – Collection and Services (RIF-CS), would simplify computer mediated use and reuse of data, particularly the automatic linking of services and data. Given the centrality of both collection interoperability, as well as data use and reuse to the Bamboo Technology Project, the adoption of an RDF encoding of an established schema such as RIF-CS for data collections would greatly aid the process of discovery as applied to both data and services.
editor’s note: Describing Collections and Collection Services for the BTP, authored by Timothy W. Cole (University of Illinois), Myung-Ja Han (University of Illinois), Doug Moncur (The Australian National University), and Harriett E. Green (University of Illinois), was presented by MJ Han and Doug Moncur at the recent DCMI International Conference on Dublin Core and Metadata Applications, The Hague, September 22, 2011.
Texts and the Citizen Scholar: Our Vision for the Next Phase
In mid-September, Project Bamboo partners gathered at the Maryland Institute for Technology in the Humanities for the next stage of planning for Phase Two of the project. In this workshop we took significant steps toward solidifying our Phase Two goals, mapping out a work plan, and preparing the proposal that will be submitted to the Mellon Foundation.
Our work in Phase II will be focused on building an infrastructure to support the exploration and curation of digital texts from a range of collections that play a key role in the research of scholars from across the humanities. This will include collections that we have been working with in the current phase — such as the HathiTrust Research Center, the Perseus Digital Library, and the Text Creation Partnership — but we will also be reaching out to other content providers in order to maximize the scholarly value of the tools and services we are developing and deploying.
Project Bamboo is also committed to reaching a wider audience in this next phase, ranging from the professional humanist to the citizen scholar. To this end we will develop applications which will be available for use by anyone who wants to explore, analyze, and enhance these text collections. We strongly believe that students at all levels and scholars from outside of the academy can play an important role in the ongoing task of preserving the record of human culture — including the content of these key text collections.
While we recognize the challenges involved in this vision for Phase Two, we will be building on the network of partnerships, systems, and architecture that we have developed during the current phase. Stay tuned to this blog for more news about our ongoing work and our plans for the future!
Partners Meet to Strategize Future Development
Project Bamboo partners gathered in late July in Evanston, Illinois to complete plans for the current work (Phase One) and begin planning for the next phase. Currently in the eleventh month of our eighteen-month initial phase, team members reviewed development progress, planned future pilot testing, and began to map out deployment strategies for the Bamboo ecosystem technologies.
Project Bamboo designed Phase One with four audiences in mind: traditional humanities scholars; humanities scholars who are deeply engaged with digital technologies; librarians and content stewards who seek to share content and support the humanities; and information scientists, tool builders, and enterprise technologists who want to further the evolution of tools and shared infrastructure for the humanities. These various groups will continue to be our target audiences as the project moves forward into the next phase of development.
In Phase One, we intentionally invested in a broad set of work with the goal of (1) integrating these initiatives into a larger ecosystem of environments, content, and tools, and (2) evaluating these diversified investments to see what was paying off and what is most needed for the next major phase of work. In Phase Two, we will continue to build out these initiatives and place significant focus on their integration. Decisions from the Evanston meeting include:
● During the next phase of work, Project Bamboo will begin to implement the architectural designs that came out of Corpora Space.
● As we move further into technical development, Project Bamboo plans to upload several demonstrators to demos.projectbamboo.org this Autumn. Stay tuned to the blog for announcements of these releases.
● Building a stronger and larger consortium is a major goal of Project Bamboo, and we will continue to evolve opportunities for groups to join as affiliates. By partnering with collections, libraries, humanities organizations, universities and others, we hope to broaden our user base and build long term stability. In addition to our partnerships with HathiTrust Research Center, Perseus Digital Library, and the Text Creation Partnership, the Advanced Research Consortium (ARC), at Texas A&M University, and the University of Alabama have recently joined as affiliates.
In the meantime, we invite you to explore our Places-Text demonstrator, featured here on this blog, and to follow the latest project updates on Twitter @projectbamboo.
Discover, Annotate, Curate: A Follow-Up on the Corpora Space ToolMixer
Corpora Space is designed to be the facet of Project Bamboo that directly engages the scholarly user. Currently in its design phase, Corpora Space will provide users with a rich research environment in which they will be able to use a range of tools to discover, annotate, collate, and curate texts across several large-scale structured humanities collections. In the first phase of Corpora Space implementation, which will begin in Spring 2012, we will connect a range of scholarly tools to several major digital textual corpora. The collections that have been selected for the first iteration of Corpora Space are HathiTrust, Text Creation Partnership (TCP) of Early English Books Online (EEBO) and Eighteenth Century Collections Online (ECCO), and Perseus Digital Library, giving scholars access to 450 years of print culture in English from 1473 to 1923, along side a selection of the Classical tradition on which that culture is based.
The principles behind the types of tools which Corpora Space will connect to these collections owe much to John Unsworth’s concept of ‘Scholarly Primitives’, which speaks to the needs of researchers, and how digital tools might meet these needs.
As part of the Corpora Space design process, the Maryland Institute for Technology in the Humanities (MITH) hosted ToolMixer on June 6-7th, a workshop that brought together scholars and tool builders (often one and the same in the digital humanities) to begin an ongoing conversation on how to make possible the vision of Corpora Space. Tool builders presented on several tools, which are currently listed on the project wiki.
In addition to exposing tools on the web and linking them to large-scale collections for the scholar to easily utilize, Corpora Space seeks to build a flexible system which will allow these disparate tools to work together in scholarly workflows. With this desire in mind, scholars, tool developers, and Bamboo partners at ToolMixer brainstormed potential workflows composed of tools presented at the workshop. The group discussions also allowed us to address potential problems which arise when attempting to make these tools work together.
One group put forward a hypothetical scholar interested in the editorial and publishing history of the first English novel. After identifying the relevant editions of Robinson Crusoe in the HathiTrust corpus, the texts would be run through Abbot and MorphAdorner, in order to transform the texts into a standard format that would allow subsequent tools to function. Once this step has been accomplished, and the texts are fully encoded, Juxta would align the texts for comparison, and PhiloLogic would index the results for searching and browsing. The addition of a curation tool to this workflow would allow users to identify errors and provide the source collection with corrected and encoded versions of the texts, thereby gradually improving the quality of the texts as scholars perform research tasks on them.
Over the course of the coming months, Bamboo partners will be evaluating a range of different tools and choosing which ones will be part of the first iteration of Corpora Space. Prototyping of tools and the architecture will begin in late 2011 and implementation in early 2012.