Thanks guys for your comments. I agree with you Florian that opening a terminal say in VSC allows you to run a shell script (an sh file) to submit your spark code, however, this really makes sense if your IDE is running on a Linux host submitting a job to a Kubernetes cluster or YARN cluster.
For Python, I will go with PyCharm which is specific to the Python world. With Spark, I have used IntelliJ with Spark plug in on MAC for development work. Then created a JAR file, gzipped the whole project and scped to an IBM sandbox, untarred it and ran it with a pre-prepared shell with environment plugin for dev, test, staging etc. IDE is also useful for looking at csv, tsv type files or creating json from one form to another. For json validation,especially if the file is too large, you may have restriction loading the file to web json validator because of the risk of proprietary data being exposed. There is a tool called jq <https://stedolan.github.io/jq/> (a lightweight and flexible command-line JSON processor), that comes pretty handy to validate json. Download and install it on OS and run it as zcat <json_file>.tgz | jq That will validate the whole tarred and gzipped json file. Otherwise most of these IDE tools come with add-on plugins, for various needs. My preference would be to use the best available IDE for the job. VSC I would consider as a general purpose tool. If all fails, one can always use OS stuff like vi, vim, sed, awk etc 🤔 Cheers view my Linkedin profile <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/> *Disclaimer:* Use it at your own risk. Any and all responsibility for any loss, damage or destruction of data or any other property which may arise from relying on this email's technical content is explicitly disclaimed. The author will in no case be liable for any monetary damages arising from such loss, damage or destruction. On Fri, 1 Oct 2021 at 06:55, Florian CASTELAIN <[email protected]> wrote: > Hello. > > Any "evolved" code editor allows you to create tasks (or builds, or > whatever they are called in the IDE you chose). If you do not find anything > that packages by default all you need, you could just create your own tasks. > > > *For yarn, one needs to open a terminal and submit from there. * > > You can create task(s) that launch your yarn commands. > > > *With VSC, you get stuff for working with json files but I am not sure > with a plugin for Python * > > In your json task configuration, you can launch whatever you want: python, > shell. I bet you could launch your favorite video game (just make a task > called "let's have a break" 😉) > > Just to say, if you want everything exactly the way you want, I do not > think you will find an IDE that does it. You will have to customize it. > (correct me if wrong, of course). > > Have a good day. > > *[image: signature_299490615]* <https://www.neuroo.ai/> > > > > [image: Banner] <http://www.redlab.io/> > > > > *Florian CASTELAIN * > *Ingénieur Logiciel* > > 72 Rue de la République, 76140 Le Petit-Quevilly > m: +33 616 530 226 > e: [email protected] w: www.redlab.io > > ------------------------------ > *De :* Jeff Zhang <[email protected]> > *Envoyé :* jeudi 30 septembre 2021 13:57 > *À :* Mich Talebzadeh <[email protected]> > *Cc :* user @spark <[email protected]> > *Objet :* Re: Choice of IDE for Spark > > IIRC, you want an IDE for pyspark on yarn ? > > Mich Talebzadeh <[email protected]> 于2021年9月30日周四 下午7:00写道: > > Hi, > > This may look like a redundant question but it comes about because of the > advent of Cloud workstation usage like Amazon workspaces and others. > > With IntelliJ you are OK with Spark & Scala. With PyCharm you are fine > with PySpark and the virtual environment. Mind you as far as I know PyCharm > only executes spark-submit in local mode. For yarn, one needs to open a > terminal and submit from there. > > However, in Amazon workstation, you get Visual Studio Code > <https://code.visualstudio.com/> (VSC, an MS product) and openoffice > installed. With VSC, you get stuff for working with json files but I am not > sure with a plugin for Python etc, will it be as good as PyCharm? Has > anyone used VSC in anger for Spark and if so what is the experience? > > Thanks > > > > view my Linkedin profile > <https://www.linkedin.com/in/mich-talebzadeh-ph-d-5205b2/> > > > > *Disclaimer:* Use it at your own risk. Any and all responsibility for any > loss, damage or destruction of data or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liable for any monetary damages arising from > such loss, damage or destruction. > > > > > > -- > Best Regards > > Jeff Zhang >
