[agi] Project "SMALLEST GPT"

immortal . discoveries Sat, 23 Oct 2021 12:21:26 -0700

I was hesitant to call this project smallest, but the fact that there is 
already a project called smallGPT that does not do justice at all or even is 
guaranteed to be GPT because he made it on a weekend, encourages me to the 
depth of hell to call my project something much much smaller and simpler. It's 
not my fault they made it so jumbled up to work on GPT.


I was hesitant to ask for money yet, because I should be able to do some of the 
simpler things, but after some thinking, all the things I need done go into the 
same bucket. It's a lot of work, and it's offshorable work. The bucket is an 
easy bucket that has a low pay. I need maybe 500USD to do the following jobs, 
so I can focus on my main meat of understanding it and building on it (and my 
original project at the same time). Usually I always get someone that can do 
the job, I simply am not a [pro] in this "area":



1) Find the GPT code that produces GPT-1 results, some implementations may use 
less layers or be kid-i-fied and may not give GPT-1 results. I want the GPT 
code in Python, there cannot be ANY C++ or C etc code it is calling or using 
like in tensorflow and pytorch etc [by the end of this project].

2) Get all needed code, Pytorch, etc, or the C++ if there is no complete python 
implementation in existence as of yet, and put it all into 1 folder, and put 
all of the code into 1 file, and remove all imports that call code like torch 
and tensorflow because we want to have /that/ larger amount of (usually C++) 
code in our single file and remove the code that calls other files and the code 
that calls and builds using tensorflow as well (those ex. 300 lines in 
smallGPT). Calling other code only makes code size larger.

3) Remove all comments in the code. A description can be made in notepad if 
need. Separate the functions in the code with 2 lines, this should only happen 
ex. 5-10 times in the code like where GPU code is, BPE, embeds, tokenization, 
ADAM, GELU, etc.

4) Make code smaller by removing self classes, etc. Put things into 1 large 
loop, so there is less def functions to achieve less code. GPT by the end of 
this project should be about 400 - 600 lines of Python code, currently the 
smaller implementation is still ~6000 lines of code, definitely not 300 lines 
like they say it is.

5) Make the project a 1 click press, so it runs and eats the dataset in the 
folder, there should not be any cmd or work involved, it should just run on GPU 
and run in my Pycharm IDE (which I love).
------------------------------------------
Artificial General Intelligence List: AGI
Permalink: 
https://agi.topicbox.com/groups/agi/T74cd23135c964c0e-Me53efaf257ff471cbfd65b2c
Delivery options: https://agi.topicbox.com/groups/agi/subscription

[agi] Project "SMALLEST GPT"

Reply via email to