Hi, Done the initial code dump! https://github.com/apache/incubator-hivemall
Let's move development (Pull requests) to the ASF repository. I'll update the project status page soon (and Dec report). Thanks, Makoto 2016-11-30 21:04 GMT+09:00 Makoto Yui <[email protected]>: > I'm considering to import https://github.com/myui/incubator-hivemall > to ASF repository tomorrow. > Let me know if it's NOT okey. > > Github tag/release issue is my concern though .. > https://lists.apache.org/thread.html/db78e1f8fc121d8e6b016d2f61d06ccafebf9fd30b4ec00883c78557@%3Clegal-discuss.apache.org%3E > > I would like to remain the past git tags to keep track of changes. > > Thanks, > Makoto > > 2016-11-30 20:35 GMT+09:00 Makoto Yui <[email protected]>: >> I'm considering to update the following way because git push does not >> work when performing shallow copy (maybe due to ASF git server >> version/configuration). >> >> You can find the tested repository on >> https://github.com/myui/incubator-hivemall >> >> $ git clone https://github.com/myui/hivemall.git incubator-hivemall >> $ git filter-branch --index-filter 'git rm -r --cached >> --ignore-unmatch lib/ target/*.jar' --tag-name-filter cat >> --prune-empty -- --all >> $ rm -rf .git/refs/original/ >> $ git reflog expire --expire=now --all >> $ git gc --aggressive --prune=now >> $ git remote set-url origin https://github.com/myui/incubator-hivemall.git >> $ git push -f -u origin master >> $ git push origin --tags --force >> >> $ git clone https://github.com/myui/incubator-hivemall.git >> $ cd incubator-hivemall >> $ git_find_big.sh | head -10 >> >> All sizes are in kB's. The pack column is the size of the object, >> compressed, inside the pack file. >> size pack SHA location >> 1391 1383 b8d432e6a3c0074951abd35caf0a777caf47afbf >> xgboost/lib/xgboost4j_0.60-0.10.jar >> 765 303 11c617713ee2ad3f847aee7627ee8639c5a79667 >> core/src/test/resources/hivemall/mf/ml1k.train >> 639 613 de4e32983604238bc72fe3f6cb6beea76fde0e8d >> src/site/resources/images/hivemall_overview_bg.png >> 382 117 8b66187fe067c3aa389ce8c98108f349ceae159c >> src/site/resources/fonts/fontawesome-webfont.svg >> 220 192 04d8605fd8daaafa72a2b6dfa2a2d48c75c57a10 >> src/site/resources/images/asf_bg.png >> 194 186 fb29a3d2ee04b7981463de89a77ccc7436f4ad9a >> docs/gitbook/resources/images/techstack.png >> 191 76 e00b1127f6fb4fdcc1606a20b05e16b5456acacc >> core/src/test/resources/hivemall/mf/ml1k.test >> 149 88 f221e50a2ef60738ba30932d834530cdfe55cb3e >> src/site/resources/fonts/fontawesome-webfont.ttf >> >> 2016-11-30 14:31 GMT+09:00 Makoto Yui <[email protected]>: >>> Hi Takeshi, >>> >>> I was almost to perform the initial code dump (stopped). >>> >>> Be aware almost all commit hash will be changed when rewriting Git logs by >>> [1]. >>> [1] git filter-branch --index-filter 'git rm -r --cached >>> --ignore-unmatch lib/ target/*.jar' --prune-empty -- --all >>> >>> So, I'm considering to make a shallow copy limiting 100-300 or so >>> (that does not include large binaries). >>> >>> Thanks, >>> Makoto >>> >>> 2016-11-30 2:44 GMT+09:00 Takeshi Yamamuro <[email protected]>: >>>> Hi, all >>>> >>>> I also have no strong opinion though, it seems it'd be better to keep as >>>> much activities (that is, commit logs) as possible there. >>>> I'm afraid few activity logs possibly make newbies misunderstand that >>>> hivemall is inactive. >>>> >>>> As for the rebasing, it's not tough to rebase #285 (this is my own pr). >>>> So, rewriting the logs sounds good to me. >>>> >>>> // maropu >>>> >>>> On Tue, Nov 29, 2016 at 11:24 PM, Makoto Yui <[email protected]> wrote: >>>> >>>>> Kai, >>>>> >>>>> 2016-11-29 22:35 GMT+09:00 Kai Sasaki <[email protected]>: >>>>> > Currently we have 6 PRs and some of them (especially #285, #336 and >>>>> > #385) >>>>> > are relatively large. >>>>> > It might cause somewhat troublesome rebasing. >>>>> >>>>> Yes, it's my concern. >>>>> >>>>> But, such large PRs should better to be contributed in the Apache >>>>> Incubation process. >>>>> I'm considering to invite some of them to the Hivemall committer. >>>>> >>>>> Another concern is moving github stars/watchers as seen in [1]. >>>>> [1] https://issues.apache.org/jira/browse/INFRA-12995 >>>>> >>>>> > Do you think some of them are not ready to be merged? I think merging >>>>> some >>>>> > of them before reflogging history >>>>> > can make migrating work easy. But if they are not ready, it's okay. We >>>>> can >>>>> > work on rebasing after this work. >>>>> >>>>> I'm currently reviewing #385 but it need to be revised in several parts. >>>>> Also, #336 requires large refactoring. >>>>> >>>>> So, better to do initial code dump first. >>>>> >>>>> Shallow copied repository can be pushed from git v1.9 and later >>>>> (I'm not sure about ASF git version though). >>>>> http://blogs.atlassian.com/2014/05/handle-big-repositories-git/ >>>>> >>>>> Thanks, >>>>> Makoto >>>>> >>>> >>>> >>>> >>>> -- >>>> --- >>>> Takeshi Yamamuro
