Ensuring Longevity
Published: 11 December 2010
This page sets out to document some of the efforts to ensure that the "Geograph Archive" is available for a long time - hopefully for many future generations. - Use of an Open Licence for content - both for Source code, and the photos themselves - Freedom - The Geograph Manifesto
- Standard data formats - images are in .jpg format, data is available in many formats (csv, xml, mysql etc).
- Data is distributed - we have an API and database dumps available. more
- We attempt to check that pages linked from image descriptions are generally Archived. See
Checking External Links
- Other entities/projects hold substantial copies of the Geograph database/Archive, examples:
WebArchive - project by the British Library. At time of writing mirrors about 1.5Million photos.
WikiMedia Commons will mirror almost our entire archive, currently, 300,000 or so available, but do have a copy of first 2M which are being uploaded. via
UpMyStreet - have a copy of approximately 600,000 images.
Copies of the images and database mailed out on request - at least 4 copies with 2M images, distributed in this way.
Many others have copies of 250,000 images from the torrents.
- Open Source code - the code is available for reuse - promoting community development.
- Code is available on external Google Project hosting - which mirrors self hosted repository. (An older more obsolete version on SourceForge)
- Multiple Backups - The images are replicated to multiple servers. And copied to two off-site locations (in near real time).