Launched in 2020, the DPLA national Wikimedia program established a digital asset pipeline to enable any participating cultural heritage institution in the United States to upload their eligible collections to Wikimedia Commons. The pipeline has since uploaded over 2.25 million files (from about 800,000 unique items) from over 200 contributing institutions.
DPLA is currently engaged in a WMF project grant to extend the digital asset pipeline, mainly by implementing structured data. Since the start of this project, DPLA has added approximately 23 million structured data statements to its uploads. With the recent implementation of references for Structured Data on Commons, is adding references to all of its statements, which will distinguish the authoritative metadata added from the source institutions from Wikimedia community statements. This work has added over 10 million references to statements, and will continue until all are referenced.
Building on this work, DPLA is currently working on a process to synchronize data in its aggregation with the live data in Wikimedia Commons. SDC statements with references to DPLA's catalog will be periodically checked for accuracy and changed, added, or removed if there are discrepancies, to keep the data in Commons always up-to-date as the uploads age over time. As part of this project, DPLA will soon begin adding Wikidata identifiers to its API as institution codes for all its items. If you would like to participate in the project, please see the discussion at the Commons DPLA WikiProject SDC modeling page.