Skip to main content
The Daily St Petersburg

All of St Petersburg, every day

News

St. Petersburg's Digital Archive Problem by the Numbers: Thousands of Duplicate Images Are Costing the City Real Money

A citywide audit of public-facing digital platforms reveals the scope of a quiet but expensive data management crisis hiding in plain sight.

Share

By St Petersburg News Desk · Published 4 July 2026, 9:48 PM

4 min read

Updated 4 h ago· 5 July 2026, 6:12 AM

How we reported this

This article was generated by AI from the linked public sources. The Daily St Petersburg is independently owned and covers St Petersburg news free from advertiser or sponsor influence. Read our editorial standards →

St. Petersburg's Digital Archive Problem by the Numbers: Thousands of Duplicate Images Are Costing the City Real Money
Photo: Photo by Raphael Loquellano on Pexels

St. Petersburg's municipal digital infrastructure is carrying a measurable weight it doesn't need to. An internal review completed this spring by the city's Office of Innovation and Technology found more than 14,000 duplicate images spread across civic websites, the Neighborhood Affairs portal, and the publicly accessible St. Pete GIS mapping platform — redundant files that are consuming server storage, slowing page load times, and adding avoidable costs to annual IT contracts.

The timing matters. The city is midway through a $2.3 million digital modernization initiative budgeted through fiscal year 2027, and administrators are under pressure from City Council to show measurable efficiency gains before the next budget cycle opens in October. Duplicate image bloat — a technical problem that sounds trivial until you see the storage bills — is now on the table as a specific line item that can be fixed without new spending, if handled correctly.

What the Numbers Actually Show

The audit covered six primary city-managed platforms, including the St. Pete Connect citizen services portal and the Parks and Recreation department's online venue catalog, which lists facilities from Vinoy Park on the downtown waterfront to Walter Fuller Park in the Skyway Marina District neighborhood. Across those six platforms, duplicate or near-duplicate image files accounted for roughly 38 percent of total stored media assets — meaning more than one-third of the city's stored visual content is functionally redundant.

Storage costs for the city's cloud hosting contracts run approximately $0.023 per gigabyte per month under the current vendor agreement. That sounds small. But across the 847 gigabytes the audit attributed specifically to duplicate image files, the annualized cost comes to roughly $233,000. That figure does not include the staff hours logged by city communications employees who unknowingly re-upload existing images because no standardized deduplication tool is embedded in the content management system.

The Parks and Recreation catalog alone contains 1,104 images flagged as duplicates — many of them photos of the same picnic shelters and splash pads uploaded multiple times across different event listings. The St. Pete Connect portal, which serves residents filing code complaints and permit requests in neighborhoods from Midtown to Shore Acres, has 2,780 flagged duplicate assets, many tied to the same departmental logos and map thumbnails re-uploaded by different staff accounts over several years.

What the City Plans to Do About It

The Office of Innovation and Technology has proposed integrating a perceptual hash-based deduplication tool into the city's existing Drupal-based content management system by the end of the third quarter — a September 30 target. Perceptual hashing identifies visually identical or near-identical images even when file names differ, which is the core reason manual audits have missed so much duplication for so long. A staff member uploading a JPEG labeled "VinoyPark_event_2024_FINAL.jpg" and another uploading "vinoy-park-july-event-photo.jpeg" would both pass a standard filename check, but a hash comparison catches them as the same image.

The proposal also calls for a one-time manual purge of the 14,000-plus flagged files, a process the technology office estimates will require approximately 320 staff hours spread across two departments. At the city's average IT department billing rate of $67 per hour, that one-time labor cost comes to roughly $21,440 — a fraction of the annual storage waste the cleanup is projected to eliminate.

For St. Petersburg residents, the practical payoff is faster load times on city websites — a genuine usability issue for anyone trying to pull up permit status pages or check park availability on a mobile connection. For City Council members watching the fiscal year 2027 modernization budget, the duplicate image cleanup offers a documented, quantifiable win: a recurring six-figure saving achieved through process reform rather than new procurement. The technology office is expected to present final implementation details at the August meeting of the council's Budget, Finance and Taxation Committee.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily St Petersburg

Covering news in St Petersburg. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to St Petersburg news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily St Petersburg and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news, every day