ordering of rows in dataframe

2023-12-05 Thread Som Lima
want to maintain the order of the rows in the data frame in Pyspark. Is there any way to achieve this for this function here we have the row ID which will give numbering to each row. Currently, the below function results in the rearrangement of the row in the data frame. def createRowIdColumn(

Re: Lightbend Scala professional training & certification

2020-04-29 Thread Som Lima
I think I am going to focus on spring boot and apache camel. I'll do Apache spark in the back ground. So see you. I am going to unsubscribe here. On Wed, 29 Apr 2020, 13:58 Som Lima, wrote: > The end value is important for me. > > I think certification in commercial framewor

Re: Lightbend Scala professional training & certification

2020-04-29 Thread Som Lima
any > loss, damage or destruction of data or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liable for any monetary damages arising from > such loss, damage or destruction. > > > > > O

Re: Lightbend Scala professional training & certification

2020-04-29 Thread Som Lima
Is there a databricks or other professional certification for Apache Spark ? On Wed, 29 Apr 2020, 11:29 Mich Talebzadeh, wrote: > Hi, > > Has anyone had experience of taking training courses with Lightbend > training on Scala > > I believe they are

Re: Converting a date to milliseconds with time zone in Scala Eclipse IDE

2020-04-29 Thread Som Lima
iable for any monetary damages arising from > such loss, damage or destruction. > > > > > On Tue, 28 Apr 2020 at 21:31, Som Lima wrote: > >> import java.time._ >> import java.util.Date >> import java.text.SimpleDateFormat >> import java.util.Local

Re: Filtering on multiple columns in spark

2020-04-29 Thread Som Lima
>From your email the obvious seems to be that 10 is an Int because it is not surrounded in quotes "" 10 should be "10". Although I can't image a telephone number with only 10 because that is what you are trying to program. In *Scala*, you can check *if *two operands *are equal* ( == ) or *not*

Re: Converting a date to milliseconds with time zone in Scala with fixed date str

2020-04-28 Thread Som Lima
.wordpress.com > > > *Disclaimer:* Use it at your own risk. Any and all responsibility for any > loss, damage or destruction of data or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liab

Re: Converting a date to milliseconds with time zone in Scala

2020-04-28 Thread Som Lima
import java.time._ import java.util.Date import java.text.SimpleDateFormat import java.util.Locale import java.util.SimpleTimeZone object CalendarDemo extends App { println("Calendar Demo") val pattern = "E dd M HH:mm:ss.SSSZ"; val simpleDateFormat = (new

Re: Converting a date to milliseconds with time zone in Scala

2020-04-28 Thread Som Lima
import java.time._ import java.util.Date import java.text.SimpleDateFormat import java.util.Locale import java.util.SimpleTimeZone object CalendarDemo extends App { println("Calendar Demo") val pattern =3D "E dd M HH:mm:ss.SSSZ"; val simpleDateFormat =3D (new

Re: Copyright Infringment

2020-04-25 Thread Som Lima
under, most are Apache > licensed but some components examples are GPL licensed. > > I hope this helps and your able to use the examples in the book to get > your job done and thanks for reading the book. > > On Sat, Apr 25, 2020 at 8:48 AM Som Lima wrote: > >> The text is very c

Re: Copyright Infringment

2020-04-25 Thread Som Lima
t; > On Sat, Apr 25, 2020, 8:42 AM Som Lima wrote: > >> At the risk of being removed from the emailing I would like a >> clarification because I do not want to commit an unlawful act. >> Can you please clarify if I would be infringing copyright due to this >> text. >&g

Copyright Infringment

2020-04-25 Thread Som Lima
At the risk of being removed from the emailing I would like a clarification because I do not want to commit an unlawful act. Can you please clarify if I would be infringing copyright due to this text. *Book: High Performance Spark * *authors: holden Karau Rachel Warren.* *page xii:* * This book

Re: IDE suitable for Spark : Monitoring & Debugging Spark Jobs

2020-04-07 Thread Som Lima
The definitive guide Chapter 18: Monitoring and Debugging "This chapter covers the key details you need to monitor and debug your Spark Applications. To do this , we will walk through the spark UI with an example query designed to help you understand how to trace your own jobs through the

Re: Serialization or internal functions?

2020-04-07 Thread Som Lima
Go to localhost:4040 While sparksession is running. Go to localhost:4040 Select Stages from menu option. Select Job you are interested in. You can select additional metrics Including DAG visualisation. On Tue, 7 Apr 2020, 17:14 yeikel valdes, wrote: > Thanks for your input Soma ,

Re: Scala version compatibility

2020-04-06 Thread Som Lima
Those who followed best practices in software development would start with a clean environment I.e. installation of operating system. Then install development tools keeping a record of version numbers. So that at the time of deployment unforeseen errors are avoided by duplicating development

Re: HDFS file hdfs://127.0.0.1:9000/hdfs/spark/examples/README.txt

2020-04-06 Thread Som Lima
t; > textFile: org.apache.spark.rdd.RDD[String] = hdfs:// > 127.0.0.1:9000/hdfs/spark/examples/README.txt MapPartitionsRDD[91] at > textFile at :27 > counts: org.apache.spark.rdd.RDD[(String, Int)] = ShuffledRDD[94] at > reduceByKey at :30 > > scala> :quit > > > jane tho

Re: Serialization or internal functions?

2020-04-05 Thread Som Lima
If you want to measure optimisation in terms of time taken , then here is an idea :) public class MyClass { public static void main(String args[]) throws InterruptedException { long start = System.currentTimeMillis(); // replace with your add column code // enough data

Re: HDFS file

2020-03-31 Thread Som Lima
Hi Jane Try this example https://github.com/apache/spark/blob/master/examples/src/main/scala/org/apache/spark/examples/streaming/HdfsWordCount.scala Som On Tue, 31 Mar 2020, 21:34 jane thorpe, wrote: > hi, > > Are there setup instructions on the website for >