<div dir="ltr">Backing up GitHub data has been brought up since the time we migrated to GitHub, and being tracked here: <a href="https://github.com/python/core-workflow/issues/20" target="_blank">https://github.com/pytho<wbr>n/core-workflow/issues/20</a><div><br></div><div>TL;DR We'll be using GitHub's new <a href="https://developer.github.com/v3/migrations/orgs/">Migrations API</a> to download archived GitHub data of CPython. Ernest is helping us get set up with daily backups of CPython repo to be stored within The PSF's infrastructure.</div><div class="gmail_extra"><br clear="all"><div><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div>Mariatta</div></div></div></div>
<br><div class="gmail_quote">On Thu, Jun 7, 2018 at 11:24 AM, Chris Angelico <span dir="ltr"><<a href="mailto:rosuav@gmail.com" target="_blank">rosuav@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On Fri, Jun 8, 2018 at 3:33 AM, Chris Barker - NOAA Federal via<br>
Python-Dev <<a href="mailto:python-dev@python.org">python-dev@python.org</a>> wrote:<br>
> Any service could change or fail. Period.<br>
><br>
> So we shouldn’t want valuable information about Python development<br>
> only in gitHub.<br>
><br>
> I don’t know how hard it is to backup / mirror an entire repo — but it<br>
> sure seems like a good idea.<br>
<br>
</span>There are two separate concerns here:<br>
<br>
1) How do we get a full copy of all of CPython and its change history?<br>
<br>
2) How do we get all the non-code content - issues, pull requests, comments?<br>
<br>
The first one is trivially easy. *Everyone* who has a clone of the<br>
repository [1] has a full copy of the code and all history, updated<br>
every time 'git pull' is run.<br>
<br>
The second one depends on GitHub's exporting facilities; but it also<br>
depends on a definition of what's important. Maybe the PSF doesn't<br>
care if people's comments at the bottoms of commits are lost (not to<br>
be confused with commit messages themselves, which are part of the<br>
repo proper), so it wouldn't matter if they're lost. Or maybe it's<br>
important to have the contents of such commits, but it's okay to<br>
credit them to an email address rather than linking to an actual<br>
username. Or whatever. Unlike with the code/history repo, an imperfect<br>
export is still of partial value.<br>
<br>
ChrisA<br>
<br>
[1] Barring shallow clones, but most people don't do those<br>
<div class="HOEnZb"><div class="h5">______________________________<wbr>_________________<br>
Python-Dev mailing list<br>
<a href="mailto:Python-Dev@python.org">Python-Dev@python.org</a><br>
<a href="https://mail.python.org/mailman/listinfo/python-dev" rel="noreferrer" target="_blank">https://mail.python.org/<wbr>mailman/listinfo/python-dev</a><br>
Unsubscribe: <a href="https://mail.python.org/mailman/options/python-dev/mariatta.wijaya%40gmail.com" rel="noreferrer" target="_blank">https://mail.python.org/<wbr>mailman/options/python-dev/<wbr>mariatta.wijaya%40gmail.com</a><br>
</div></div></blockquote></div><br></div></div><div hspace="streak-pt-mark" style="max-height:1px"><img alt="" style="width:0px;max-height:0px;overflow:hidden" src="https://mailfoogae.appspot.com/t?sender=abWFyaWF0dGEud2lqYXlhQGdtYWlsLmNvbQ%3D%3D&type=zerocontent&guid=8ee21e7d-9226-4e7a-911b-33a200b7763b"><font color="#ffffff" size="1">ᐧ</font></div>