We are pleased to announce the release of Version 2 of World Historical Gazetteer! New features have been added, and we’ve made several significant improvements to usability. This work was made possible by the continued support of our home institution, the World History Center at the University of Pittsburgh, and especially by the collaboration and support of the Humanities Cluster of KNAW.
In recent months we asked several contributors to pause their data preparation in the WHG system while these improvements were made. We can finally “re-open the doors” so to speak, so we invite those efforts to resume, and again encourage new contributions and collaborations. We will respond quickly to any bug reports or general inquiries about using the platform.
Over the next several months we will be adding quite a bit more data that is already in the queue. Although much user interaction with the WHG platform is self-guided and semi-automated, we have found that contributions move most smoothly with staff support. WHG staff stand ready to help with data conversion strategies and with the planning of contributions generally. Please do get in touch with us (whg at pitt dot edu) or with any individual WHG project team members individually.
The Site Guide and several tutorials on the WHG site describe its features and their use in some detail. The following briefly summarizes what is new since Version 1.
Registered users can now create “collections,” linking sets of existing public datasets within the system for purposes of presentation and combined search. This new feature aims at supporting the development of “focus regions” within WHG by collaborative groups with overlapping region/period interests.
Previously, search capability was limited to records fully accessioned into the WHG “union index,” and returned sets of one or more “closely matched” attestations of a place. This kept from view public datasets that had not yet been indexed. An option to search all public data within the WHG database—indexed or not—has been added to give a more complete view of the data we hold.
We have adopted the term “linking” to refer to all tasks of reconciliation and alignment—to external the external sources Wikidata, Getty TGN held in our sytem and to our own WHG union index. All of these require a “Review” step, where the prospective matches discovered in the task are presented for closeMatch/no match/defer decisions. The progress of this process, which can sometimes extend over time and involve multiple people, is now tracked in the Dataset Browse screen available to the dataset contibutor (“owner”) and designated collaborators. The choice to “defer” is also new since v1.2; it permits maintaining a separate queue of records, allowing users to move quickly through the easier decisions and set aside those requiring more attention, or review by others.
Views and downloads of public data
We now provide summary descriptions and mapped browsing for all datasets, collections, and individual place records that have been flagged as public. Public datasets can now be downloaded, according to CC-BY-4.0 license terms.
We have implemented the MapLibreGL technology for our Dataset and Collection maps, dramatically enhancing the speed of rendering large numbers of features.
Local Wikidata index
Since Version 1.2 in May, we have maintained a local index of about 3.6 million Wikidata place records, making reconciliation tasks for that resource 3x faster than the earlier SPARQL queries over the web–processing about 150-180 records per minute.
More reliable upload validation
Accounting for every possible anomaly or error in upload files is tricky. We have significantly improved the validation algorithm, trapping more errors with more user-friendly responses.
Site documentation has been edited and extended, and a number of display problems were fixed. SSL protocol (https) has been implemented for secure transfer.