Great work!

// maropu

On Thu, Dec 1, 2016 at 2:29 PM, Makoto Yui <[email protected]> wrote:

> Hi,
>
> Done the initial code dump!
> https://github.com/apache/incubator-hivemall
>
> Let's move development (Pull requests) to the ASF repository.
>
> I'll update the project status page soon (and Dec report).
>
> Thanks,
> Makoto
>
>
> 2016-11-30 21:04 GMT+09:00 Makoto Yui <[email protected]>:
> > I'm considering to import https://github.com/myui/incubator-hivemall
> > to ASF repository tomorrow.
> > Let me know if it's NOT okey.
> >
> > Github tag/release issue is my concern though ..
> > https://lists.apache.org/thread.html/db78e1f8fc121d8e6b016d2f61d06c
> cafebf9fd30b4ec00883c78557@%3Clegal-discuss.apache.org%3E
> >
> > I would like to remain the past git tags to keep track of changes.
> >
> > Thanks,
> > Makoto
> >
> > 2016-11-30 20:35 GMT+09:00 Makoto Yui <[email protected]>:
> >> I'm considering to update the following way because git push does not
> >> work when performing shallow copy (maybe due to ASF git server
> >> version/configuration).
> >>
> >> You can find the tested repository on https://github.com/myui/
> incubator-hivemall
> >>
> >> $ git clone https://github.com/myui/hivemall.git incubator-hivemall
> >> $ git filter-branch --index-filter 'git rm -r --cached
> >> --ignore-unmatch lib/ target/*.jar' --tag-name-filter cat
> >> --prune-empty -- --all
> >> $ rm -rf .git/refs/original/
> >> $ git reflog expire --expire=now --all
> >> $ git gc --aggressive --prune=now
> >> $ git remote set-url origin https://github.com/myui/
> incubator-hivemall.git
> >> $ git push -f -u origin master
> >> $ git push origin --tags --force
> >>
> >> $ git clone https://github.com/myui/incubator-hivemall.git
> >> $ cd incubator-hivemall
> >> $ git_find_big.sh | head -10
> >>
> >> All sizes are in kB's. The pack column is the size of the object,
> >> compressed, inside the pack file.
> >> size  pack  SHA                                       location
> >> 1391  1383  b8d432e6a3c0074951abd35caf0a777caf47afbf
> >> xgboost/lib/xgboost4j_0.60-0.10.jar
> >> 765   303   11c617713ee2ad3f847aee7627ee8639c5a79667
> >> core/src/test/resources/hivemall/mf/ml1k.train
> >> 639   613   de4e32983604238bc72fe3f6cb6beea76fde0e8d
> >> src/site/resources/images/hivemall_overview_bg.png
> >> 382   117   8b66187fe067c3aa389ce8c98108f349ceae159c
> >> src/site/resources/fonts/fontawesome-webfont.svg
> >> 220   192   04d8605fd8daaafa72a2b6dfa2a2d48c75c57a10
> >> src/site/resources/images/asf_bg.png
> >> 194   186   fb29a3d2ee04b7981463de89a77ccc7436f4ad9a
> >> docs/gitbook/resources/images/techstack.png
> >> 191   76    e00b1127f6fb4fdcc1606a20b05e16b5456acacc
> >> core/src/test/resources/hivemall/mf/ml1k.test
> >> 149   88    f221e50a2ef60738ba30932d834530cdfe55cb3e
> >> src/site/resources/fonts/fontawesome-webfont.ttf
> >>
> >> 2016-11-30 14:31 GMT+09:00 Makoto Yui <[email protected]>:
> >>> Hi Takeshi,
> >>>
> >>> I was almost to perform the initial code dump (stopped).
> >>>
> >>> Be aware almost all commit hash will be changed when rewriting Git
> logs by [1].
> >>> [1] git filter-branch --index-filter 'git rm -r --cached
> >>> --ignore-unmatch lib/ target/*.jar' --prune-empty -- --all
> >>>
> >>> So, I'm considering to make a shallow copy limiting 100-300 or so
> >>> (that does not include large binaries).
> >>>
> >>> Thanks,
> >>> Makoto
> >>>
> >>> 2016-11-30 2:44 GMT+09:00 Takeshi Yamamuro <[email protected]>:
> >>>> Hi, all
> >>>>
> >>>> I also have no strong opinion though, it seems it'd be better to keep
> as
> >>>> much activities (that is, commit logs) as possible there.
> >>>> I'm afraid few activity logs possibly make newbies misunderstand that
> >>>>  hivemall is inactive.
> >>>>
> >>>> As for the rebasing, it's not tough to rebase #285 (this is my own
> pr).
> >>>> So, rewriting the logs sounds good to me.
> >>>>
> >>>> // maropu
> >>>>
> >>>> On Tue, Nov 29, 2016 at 11:24 PM, Makoto Yui <[email protected]>
> wrote:
> >>>>
> >>>>> Kai,
> >>>>>
> >>>>> 2016-11-29 22:35 GMT+09:00 Kai Sasaki <[email protected]>:
> >>>>> > Currently we have 6 PRs and some of them (especially #285, #336
> and #385)
> >>>>> > are relatively large.
> >>>>> > It might cause somewhat troublesome rebasing.
> >>>>>
> >>>>> Yes, it's my concern.
> >>>>>
> >>>>> But, such large PRs should better to be contributed in the Apache
> >>>>> Incubation process.
> >>>>> I'm considering to invite some of them to the Hivemall committer.
> >>>>>
> >>>>> Another concern is moving github stars/watchers as seen in [1].
> >>>>> [1] https://issues.apache.org/jira/browse/INFRA-12995
> >>>>>
> >>>>> > Do you think some of them are not ready to be merged? I think
> merging
> >>>>> some
> >>>>> > of them before reflogging history
> >>>>> > can make migrating work easy. But if they are not ready, it's
> okay. We
> >>>>> can
> >>>>> > work on rebasing after this work.
> >>>>>
> >>>>> I'm currently reviewing #385 but it need to be revised in several
> parts.
> >>>>> Also, #336 requires large refactoring.
> >>>>>
> >>>>> So, better to do initial code dump first.
> >>>>>
> >>>>> Shallow copied repository can be pushed from git v1.9 and later
> >>>>> (I'm not sure about ASF git version though).
> >>>>> http://blogs.atlassian.com/2014/05/handle-big-repositories-git/
> >>>>>
> >>>>> Thanks,
> >>>>> Makoto
> >>>>>
> >>>>
> >>>>
> >>>>
> >>>> --
> >>>> ---
> >>>> Takeshi Yamamuro
>



-- 
---
Takeshi Yamamuro

Reply via email to