Another heavily aspected focus was to start improving the photographs uploaded from the National Museums Finna.fi service to the Wikimedia Commons. The first main target was reliably identifying and connecting Wikimedia Commons images to corresponding images in Finna. As the image information written on the Wikimedia Commons image page could have been non-machine readable, incorrect, or expired, the first task was to calculate perceptual hashes for images and match images using these. After this, we updated up-to-date source information to Commons images wikipage and SDC. We also systematically started adding SDC information about photographers, licenses, and dates. We also reuploaded the older low-resolution photos with improved quality, as museums have substantially increased image quality in the last five years. We continued this by writing a Finna-to-Wikimedia Commons metadata parser, which was then used to upload images from the JOKA Journalistic photo archive to Wikimedia Commons.
In 2023, we indexed approximately 15M Wikimedia Commons images using Python’s image hash library. We also updated information to 39000 photos and uploaded 3900 photos.