We have a big, giant sloppy Subversion repository that contains 60+ projects. The trunk
, branches
, and tags
directories are in the root of the repository. Some branches are done branches/project/branchName
. Others are done branches/BranchName/project
. There's a lot of cruft.
There are almost 200,000 revisions, 22Gb, and 60+ projects.
I want to restructure the repository, so each project has its own repository, and institute standard branching strategy. Dumping the entire repository takes about 7 to 8 hours, and then filtering out what I want is a very long process since I have to run svndumpfilter
multiple times.
I am thinking of a new strategy. If I look at the revisions involved in a single project, we might be talking about 400 revisions. I know I can run svnadmin dump
on a range of revisions. What if I dump out just the revisions of the project I'm interested in? I can run svnadmin dump
for each revision. I think this might actually be faster. However, how will this affect the load into a new repository?
Is there a problem of simply dumping only the revisions I want?