Please share this call with anyone in your networks who may be interested!
The World History Center and the Department of History at the University of Pittsburgh seek applicants for a two-year postdoctoral fellowship in Digital World History beginning Fall 2023. Candidates must have completed a Ph.D. in history or a related discipline before June 2023.
We seek candidates who have a background in history or a related discipline and who also have expertise in one or more digital methodologies, including data development, scripting languages, and linked data. Experience working in the spatial humanities with tools such as QGIS and ArcGIS is required. Applicants from information science who have demonstrable expertise in history are also welcome to apply. The successful candidate should also have experience in digital humanities pedagogy and teaching.
In addition to engaging in their own research, the postdoctoral fellow will collaborate on the Center Director’s current research initiatives, including the NEH-funded World Historical Gazetteer. The successful candidate will supervise 1-2 student workers and oversee the transformation of data into Linked Places format. Experience working with data using relational databases, scripting languages such as Python or R, and SQL is highly desirable. Experience developing web maps is also desirable.
The Fellow will serve as the student supervisor and lead instructor for the Digital Atlas Design Internship (https://www.worldhistory.pitt.edu/education/digital-atlas-design-internship). This will require regular meetings with students and faculty advisors. The Fellow is also expected to advise faculty and graduate students about digital methodologies and tools, consult with the system administrators who support the Center’s servers, and participate actively in the Center’s activities, events, and intellectual community.
Salary and benefits are competitive. Please apply on Pitt Talent Center by uploading a letter of application and a full CV. We will request letters of reference and a writing sample from semi-finalist candidates. The deadline for applications is March 20, 2023.
The grant will allow the project team to develop infrastructure, content, and community for Version 3 of the WHG. The index will more than double in size; the suite of tools will evolve to better support teachers, contributors, and end users; and the team will expand opportunities to involve diverse and global communities of board members, scholars, learners and developers.
The NEH’s Digital Humanities Advancement Grants program (DHAG) supports innovative, experimental, and/or computationally challenging digital projects, leading to work that can scale to enhance scholarly research, teaching, and public programming in the humanities. In support of its efforts to advance national information infrastructures in libraries and archives, the Institute of Museum and Library Services (IMLS) provides funding through this program.
ABOUT THE NATIONAL ENDOWMENT FOR THE HUMANITIES
Created in 1965 as an independent federal agency, the National Endowment for the Humanities supports research and learning in history, literature, philosophy, and other areas of the humanities by funding selected, peer-reviewed proposals from around the nation. Additional information about the National Endowment for the Humanities and its grant programs is available at: www.neh.gov.
On 13 September, I gave an invited talk, titled “Connecting Places with World Historical Gazetteer” at the Royal Dutch Academies Humanities Cluster offices in Amsterdam (KNAW-HuC). The slides are provided here, with some annotation.
The WHG has been featured in the Information Ecosystems Cookbook, an Open Educational Resource (OER) supported by a Sawyer Seminar grant from the Andrew W. Mellon Foundation and facilitated by the Visual Media Workshop at the University of Pittsburgh. The InfoEco Cookbook is comprised of modules organized by curricular pathways: Data in Context, Datafication, Data Structures, Platform Studies, and Producing and Using Datasets.
In the WHG module, “GIS Data: Using Linked Open Data and the World Historical Gazetteer to Map Spatial Data,” users learn how to make a gazetteer and how to index it in the World Historical Gazetteer. The module has four main sections. A “do” section asks learners to create a geospatial dataset by offering step-by-step instructions. A “watch” section includes a video that demonstrates how to reconcile data against the WHG index. An “explore” section provides alternative GIS tools and datasets for learners to engage with. And finally, a “guiding realizations” section describes the overarching concepts and challenges of the module.
The module was designed with different ways to do the lesson. One option is to follow along with an active facilitator guiding learners through the sections. Alternatively, one can do the lesson individually and asynchronously. The WHG’s InfoEco Cookbook module is a great way for interested users to learn how to format, upload, and reconcile a dataset into the World Historical Gazetteer!
The University of Pittsburgh’s World History Center is seeking applicants for a two-year postdoctoral fellowship in Digital World History beginning Fall 2022. The postdoctoral fellow will collaborate closely with the World Historical Gazetteer team and should have expertise in one or more digital methodologies, including data development, scripting languages, and linked data. Please share this opportunity widely! See below for application information.
Proficiency in digital mapping and spatial analysis is particularly desirable. Applicants from information science who have demonstrable expertise in history are also welcome to apply. In addition to engaging in their own research, the successful candidate will collaborate on the Center Director’s current research initiatives, including the World Historical Gazetteer. The successful candidate should also have experience in digital humanities pedagogy and teaching.
The Fellow will serve as the student supervisor and lead instructor for the Digital Atlas Design Internship. This will require regular meetings with students and faculty advisors. The Fellow is also expected to advise faculty and graduate students about digital methodologies and tools, consult with the system administrators who support the Center’s servers, and participate actively in the Center’s activities, events, and intellectual community.
Salary and benefits are competitive. Candidates must have completed their Ph.D. before June 2022. Please apply at the Pitt Talent Center by uploading a cover letter and a full CV. We will request letters of references and a writing sample from semi-finalist candidates. Deadline for applications is March 20th, 2022. Please direct any questions to WHC@PITT.EDU
We are pleased to be co-sponsoring with several colleagues a workshop/activity at the upcoming Linked Pasts VII Symposium. The activity will take place in two two-hour sessions on Wednesday and Thursday, the 15th and 16th of December (both starting at 16:00 CET). If you would like to participate, and/or be kept informed of details as the dates approach, please register your interest in this online form so we can be in touch with you.
Conveners: Tomasz Panecki, Bogumil Szady, Grzegorz Myrda (Polish HGIS); Karl Grossner, Ruth Mostern (World Historical Gazetteer)
Reconciliation in the Linked Pasts context is the task of aligning records concerning named historical entities contained in one dataset with those of another, typically for places or people. Often it is a research dataset being reconciled against some authoritative resource, but sometimes aligning “peer” datasets is the goal. We perform reconciliation in order to augment our dataset with attributes gained from another (e.g. geographic coordinates and concordance identifiers in the case of places), and to link our records (and by extension our research) with that of colleagues concerned with the same places and people.
The conveners of this activity have identified three particular issues we are interested to focus on for place data, in discussion and in a related exercise. The first is the quality of reference datasets (e.g. Wikidata/DBpedia, TGN, etc.) and how their granularity influences the process of alignment. The second issue is the consistency of matching decisions between multiple reviewers. Although algorithms may help users to match place names with their modern counterparts, choices are often ambiguous. The third issue is the sustainability of results and whether such linked data products can be treated as reference works, which is associated with the problem of how the reconciliation of place names relates to the identification of geographic phenomena over time.
The activity will begin with 10-minute presentations from four discussants who have various perspectives on reconciliation from their own work. Their geographic areas of concern include historical Polish territories, the broad Middle East, Russia, and the UK. Following that, in a demo/exercise participants will jointly perform a reconciliation exercise on a portion of a 100-row example dataset assembled for this purpose, using the World Historical Gazetteer platform. Afterwards, free discussion will close out the first session. Participants will be asked to upload the sample data file into their own private space in WHG in between sessions, and perform the reconciliation task on all 100 records (this should take 30 minutes or less).
In the second session, we’ll review those records that had the most disagreement between reviewers in matching results, then have extended discussion amongst all participants about reconciliation in general and possible next steps. We anticipate the experience will foster interesting and useful discussion that will inform a best practices white paper to be co-authored by the conveners following Linked Pasts 7.
In recent months we asked several contributors to pause their data preparation in the WHG system while these improvements were made. We can finally “re-open the doors” so to speak, so we invite those efforts to resume, and again encourage new contributions and collaborations. We will respond quickly to any bug reports or general inquiries about using the platform.
Over the next several months we will be adding quite a bit more data that is already in the queue. Although much user interaction with the WHG platform is self-guided and semi-automated, we have found that contributions move most smoothly with staff support. WHG staff stand ready to help with data conversion strategies and with the planning of contributions generally. Please do get in touch with us (whg at pitt dot edu) or with any individual WHG project team members individually.
The Site Guide and several tutorials on the WHG site describe its features and their use in some detail. The following briefly summarizes what is new since Version 1.
Registered users can now create “collections,” linking sets of existing public datasets within the system for purposes of presentation and combined search. This new feature aims at supporting the development of “focus regions” within WHG by collaborative groups with overlapping region/period interests.
Previously, search capability was limited to records fully accessioned into the WHG “union index,” and returned sets of one or more “closely matched” attestations of a place. This kept from view public datasets that had not yet been indexed. An option to search all public data within the WHG database—indexed or not—has been added to give a more complete view of the data we hold.
We have adopted the term “linking” to refer to all tasks of reconciliation and alignment—to external the external sources Wikidata, Getty TGN held in our sytem and to our own WHG union index. All of these require a “Review” step, where the prospective matches discovered in the task are presented for closeMatch/no match/defer decisions. The progress of this process, which can sometimes extend over time and involve multiple people, is now tracked in the Dataset Browse screen available to the dataset contibutor (“owner”) and designated collaborators. The choice to “defer” is also new since v1.2; it permits maintaining a separate queue of records, allowing users to move quickly through the easier decisions and set aside those requiring more attention, or review by others.
Views and downloads of public data
We now provide summary descriptions and mapped browsing for all datasets, collections, and individual place records that have been flagged as public. Public datasets can now be downloaded, according to CC-BY-4.0 license terms.
We have implemented the MapLibreGL technology for our Dataset and Collection maps, dramatically enhancing the speed of rendering large numbers of features.
Local Wikidata index
Since Version 1.2 in May, we have maintained a local index of about 3.6 million Wikidata place records, making reconciliation tasks for that resource 3x faster than the earlier SPARQL queries over the web–processing about 150-180 records per minute.
More reliable upload validation
Accounting for every possible anomaly or error in upload files is tricky. We have significantly improved the validation algorithm, trapping more errors with more user-friendly responses.
Site documentation has been edited and extended, and a number of display problems were fixed. SSL protocol (https) has been implemented for secure transfer.
After three years of development, we are pleased to announce the launch of Version 1 of the World Historical Gazetteer (WHG), at whgazetteer.org. Version 1 follows six beta releases over the past year or so. The WHG presently indexes 1.8 million modern place references and approximately 60,000 temporally scoped records.
In addition to filtered search and API access to data, we have developed a suite of tools that allow you to upload place datasets into a private workspace, augment them by reconciling them against the Getty Thesaurus of Geographical Names and Wikidata, publish them as Linked Open Data, and contribute them for accessioning to the WHG index.
We have a long list of planned improvements and a queue of in-progress data contributions. More data is very welcome of course, and your feedback is essential! You can use the site contact form, create an issue on GitHub, or simply write to us at email@example.com.
We have completed a Site Guide that describes the purposes, functionality, and data of the present system. The Tutorials and About sections of the site provide additional information. We will continue keeping our nearly 500 Twitter followers current with news of new data, new features, and bug fixes. We also plan to keep our blog updated with relevant announcements and discussion of the project’s future course..
We are pleased to announce this major step in our project and we look forward to your
The World Historical Gazetteer Team
Ruth Mostern, Karl Grossner, and Susan Grunewald
This post aims to clarify the relationships between a few of the models now in development for various uses by members of the historical linked data community, particularly with regard to geography (place)—namely Linked Places, Linked Traces, and Linked Art. Figure 1 provides a conceptual overview (click to magnify).
In several key respects the World Historical Gazetteer project (WHG; now in beta release 0.3) builds upon software and data development work produced by the Pelagios project—particularly the historical gazetteer infrastructure underlying its Peripleo and Recogito software applications.
Peripleo is a pilot application (no longer in active development) built to demonstrate a few key linked-data-for-history functions: a) search of a central index aggregating historical gazetteer records published as Linked Data, b) the annnotation of web-published records about historical objects with identifiers for relevant places (mostly coins and inscriptions in this case), and c) the display of search results for both in a map interface. WHG performs those functions also, along with some others.
Recogito is an annotation platform that among other things makes use of that historical gazetteer index by facilitating association of place references tagged in textual sources with the identifiers, coordinates, and name variants found in the indexed gazetteer records.
I have collaborated with Pelagios developer Rainer Simon and a few other interested folks to develop a Linked Places model and format particularly for contributions to the Pelagios and WHG platforms. The Pelagios and WHG indexes will have considerable overlap in coverage, but we anticipate that of WHG will over time become considerably broader in space and time—due primarily to its built-in semi-automated data development and contribution pipeline and stated goal of global breadth.
Because both projects have interest in annotations, we have also begun jointly developing a Linked Traces format—more precisely a set of implementation patterns using the W3C Web Annotation format standard for digital history and GLAM applications.
With that introduction, what follows are some details about Linked Places and Linked Traces, and thoughts about their immediate and potential uses. Also, given the concurrent development of the Linked Art model and ontology, some thoughts about how all of these might in time relate to each other in practice. Figure 1 above should provide useful reference.
Linked Places: model and data format
The Linked Places model and interconnection format (LPF) were developed to meet the particular requirements of the WHG and Pelagios platforms: a common data structure that both could ingest routinely without the need to accommodate on a case-by-case basis the enormous variety of data models in use by digital historical projects, large and small. LPF is a set of extensions to GeoJSON-LD, itself a Linked Data enabling extension to the most widely implmented test-based format for representing geographic features, GeoJSON (an IETF standard).
LPF also adds a standard means for adding time to GeoJSON features, introducing “when” objects to permit temporally scoping of a) an entire Feature, and/or b) its individual names, place types, geometry, and relations to other places, in any combination.
Uploads to WHG (and accessioning to both Pelagios and WHG indexes) require creating a serialization (i.e. transforming export) of place data from whatever form it is maintained in to LPF. We have also developed an abbreviated delimited text file format (LP-TSV) to meet the needs of contributors with relatively simple records.
Figure 1 summarizes the LPF conceptual model.
Linked Art: model and format
A global consortium of organizations involved in the domains of art, cultural heritage and archaeology—principally large museums and universities—are jointly developing Linked Art, a “shared model based on Linked Open Data to describe Art,” along with software implementations of it. The conceptual model is being formalized in an ontology with a subset of CIDOC-CRM entities and relations, and expressed as a data format using JSON-LD, a syntax of RDF.
From the perspective of WHG, Linked Art is a format many prospective users of our platform may adopt to describe objects in their collections. Both WHG and Pelagios are agnostic as to what formats our users and data partners use, and as mentioned above, users will have to perform a serialization to Linked Places format to interact with our platforms.
Figure 1 shows how Place appears in the Linked Art model. The points of contact with Linked Places are identifiers. One kind of identifier in Linked Art is a URI to a linked data gazetteer resource. A serialization of Places from a Linked Art dataset to LPF should include as many such identifiers as can be managed. WHG can aid discovery of those URIs via reconciliation services to Getty TGN and Wikidata.
All that said, place data from Linked Art collections are unlikely to be good candidates for contributions to WHG; the great majority of places will already be indexed. Rather, it is Linked Traces data that will be more relevant.
Linked Traces: model and format
WHG is following on from the Peripleo pilot in experimentally indexing not only place data, but what we are calling trace data: “annotations of web-published records about historical objects with identifiers for relevant places.” We say “experimentally” because it seems likely that the most useful web interfaces to trace data will be distinct from those for place data. Certainly there will be significant scaling issues.
In order to continue exploring the linking of places and associated traces, we (Rainer Simon and I) have also initiated development of a Linked Traces format, as a potential standard for use by the WHG and Recogito platforms. Linked Traces is turning out to be a set of implementation patterns for the W3C Web Annotation format (WA).
Annotating records of “anything” with URIs for web-published place records is but one use case for WA. For example, in Recogito, users annotate texts with references to not only places, but also people, events, and relations between all three.
Figure 1 indicates the way that a set of one or more place records can form the body of an annotation. The JSON form of the body in that example corresponds to an early draft of a “Linked Traces place pattern” in development. The working group’s activities are paused at the moment, but WHG is developing some exemplar data according to that draft, to be explored in our Version 1 release, slated for late spring 2020.