Annotation and User Guidelines: Archival Gossip Collection

This page provides explanations of scholarly and technical terms (in alphabetical order) central to understanding how ArchicalGossip.com/Collection has been designed.

For a complementary overview of the specific ontology on which we rely to order items, please see this page.

Archival Documents

The archival documents that we collected during several research stays in the United States are embedded as PDFs with the plugin PDF Embed. For the transcriptions, we maintained the page breaks for users to easily look up the original text passages.

Content Management System Information

Omeka Classic 3.0.2 embedded in WordPress

php

theme: northfield master

Exhibits

Based on our research questions, we assigned the items to different exhibits such as ‘Gossip and/as Foreign Correspondence’ or ‘Cushman’s Networks.’

Gender

We avoided ascribing binary gender categories to historical agents. We are still using pronouns to avoid repetitions of names in the description text fields. The letters often give away whether a person perceived themselves as a woman or a man. These categories, however, had and have multiple dimensions in terms of behavior, societal ideals, gender performances, etc. In particular in Cushman’s circles, there were many women that did not comply with the feminine ideal of Victorian women. Cushman built her success on this deviation from the norm, as her performances as Romeo illustrate. Please refer to items marked with the “gender bending”-tag for further annotated primary material.

Geocoding

Geocoding data is included to enable dynamically mapping letter correspondence in a later stage of research. Additionally, we use the Geolocation plugin provided by Omeka in order to display the location where a letter was written, and an article or monograph was published. In doing so, we can trace the geographical space that historical agents like Charlotte Cushman acted in or were talked about. The geolocation is displayed on the item page and in a browseable map, which is one of the main menu options.

for longitude and latitude data, we refer to https://www.mapdevelopers.com/geocode_tool.php
format: place, state, country; longitude & latitude (according to 21st-century standards and data)
we distinguish between location of recipient and sender where possible (e.g. letters)

Item Search

Users are enabled to search the items for their specific interests. Users should use the “advanced search”-option. Additionally, there is a “refine search”-option enabled through the Facet Plugin.

Metadata

We adhere to the Dublin Core standards and customized some additional item-type-specific information such as text type or annotation information.

Relationships

The relationship plugin AvantRelationships provides the opportunity to visually display relevant relationships among items. At the end of the metadata information, users are able to click on the respective items to be directed to that specific item entry.

We have customized the relationships but refrained from defining them too narrowly. Hence, we can specify the relationship “in a relationship with” semantically in the description box of items of the type “person” to capture the historical specificities of intimacies between women in Cushman’s circles, for instance. The plugin emphasizes the networks of the historical agents, human and non-human, presented in the collection(s).

Secondary Sources

A list of secondary sources can be found on the blog’s website (WordPress). The sources are assigned to different research categories such as ‘gender/sexuality’ and ‘19^th century literature’ to show the research areas that the project is interested in as well as to guide the user’s search. For the item pages, the quotes and notes taken from secondary sources operate on a last name basis sticking to MLA8 citation guidelines.

For

Culkin, Kate. Harriet Hosmer: A Cultural Biography. U of Massachusetts P, 2010.

the item page would show

(Culkin 349).

Primary sources are also listed on the blog’s website.

Please note that these lists are a work in progress.

Spelling

In our transcriptions, we stick to the original spelling conventions: e.g. “every one” or “some thing” are not marked with [sic]. For inconsistencies in spelling, such as a missing apostrophe for the genitive case or negated auxiliaries, [sic] annotations are used in the transcription text.

For Cushman’s handwriting, u/n, beg/by, our/one, leaving/having are frequent words that are difficult to distinguish. Transcribed words guessed from the context are marked with [?]. In particular, Cushman used random dots/periods in her handwriting. We refrained from replacing them with commata or deleting them.

Transcription Annotations

When transcribing the original document, we aim at reproducing the annotations that we encounter on the page. For instance, if the author of the letter or diary entry underlined a passage, we indicate that in the transcription. If the archivist added information such as the year that the letter was presumably written and sent, we indicate that in the annotation metadata box that we have defined and set up in Omeka (Item Type Metadata).

[inserted]

insertions: indicated by [inserted] following the words that are inserted; indicated if more than one word is inserted, e.g. [last five words inserted] or if a symbol marks a reference to another page, e.g. X [references insertion on 2505]

+++

gaps indicated by +++ if word cannot be deciphered and/or is blackened

[sic]

if wrong spelling is used in the original document

[p. 1]/[page 1]/[3399] etc.

separates document pages to enhance clarity, indicates if the archival document is marked with a number, e.g. LoC, CCP 1:3399 which means that the letter is from the Library of Congress, Charlotte Cushman Collection, Box 1, page number 3399; the page number was assigned by the respective archive; if such a number is missing, we have inserted [page 1] etc. if we separated the transcriptions page by page

[3399 reverse]

the reverse of a page that was assigned a number by an archive does not show a number, to enhance clarity we marked the transcription of such a page by [reverse 3399]

[?]

researcher who transcribed the word is unsure about the transcription

no line breaks indicated
letter head (place, date, day of the week) not transcribed as information, it can be found in metadata fields
signature is transcribed
[…]

used to indicate partial transcription if only the relevant parts are trancribed

Transkribus

Initially, we described documents without any technical support. Starting April 2020, we have been using the expert tool Transkribus. At first, we trained our own model based on our own transcriptions of more than 180 pages. This model is repeatedly improved and modified. Transkribus uses handwritten text recognition and layout analysis. At the moment, it is still for free, which may change in the near future. It is an excellent tool that helps with line-by-line description and automatic transcriptions. Currently, the CER for our model is slightly above 9%. Thus, the training is a work in progress.

If the transcripts are provided by another institution or researcher, those instances are marked in the description box of the respective item entry.

Archival Gossip