nfdump install instructions seem inaccurate

2020-02-04 Thread skip cruse
I noticed when setting up nfdump for spot-ingest that there were some errors around the version of automake that’s requried. Apparently automake 1.14 is required and after installing that everything worked fine. I did some digging and it looks like this was already raised via SPOT-178:

install instructions have a typo

2020-02-14 Thread skip cruse
I was following the install instructions and noticed that the commands cannot be copy and pasted as there are missing line breaks in the commands. Additionally a -j is needed on the tar command to successfully extract a .bzip2 file and extract wireshark. We should update this page to have proper

Re: install instructions have a typo

2020-02-15 Thread skip cruse
a PR and we'll get it reviewed. > > Thank you, > Tadd Wood > > On Fri, Feb 14, 2020 at 9:39 AM skip cruse wrote: > > > I was following the install instructions and noticed that the commands > > cannot be copy and pasted as there are missing line breaks in the > comm

update copyright

2020-01-13 Thread skip cruse
I think we should update the copyright notices on the spot website http://spot.apache.org/get-started/ to 2020. -- if( bool halfWayThere == true); echo "WAH! LIVIN ON A PRAYER";

Re: [SPOT-INGEST] Ingest file organization

2020-01-14 Thread Skip Cruse
We should keep the name /spot-ingest/ for the original ingester, but move the new ingester to /spot-ingest-sparkstreaming/ or similar. Hopefully we can use the ticket to track down the files that were created, so we can move them to a new home easily. Get Outlook for

ingest_conf question

2020-01-22 Thread skip cruse
I was setting up Spot-Ingest, and I noticed that I needed to create a “ingest_conf.json” file. The documentation on http://spot.apache.org/doc/#deployment refers to a few things, but doesn’t go into any details. It does link to the github page at

Place where we can download sample test files

2020-04-13 Thread skip cruse
Does anyone have access to the original Data Samples pointed at here? ( DATA_SAMPLE.md ) . It looks the links broke sometime back to the s3 bucket. Also would be great if anyone could share links to other sample datasets

wireshark version

2020-04-21 Thread skip cruse
I noticed when making my changes on PR-159 ( https:/ /github.com/apache/incubator-spot/pull/159) that the Wireshark download link isn’t actually pointing to the right version of Wireshark that we claim to support (2.0.1), and the link also appears to be broken. I’m going to update this link, but I

Re: [SPOT-266] ml_ops.sh script should use Spark 2?

2020-03-23 Thread skip cruse
That seems like a fair solution. I noticed there’s also a spark-submit command that’s required when running this pipeline:worke r.py . We should make the update here as

Re: Hive CLI, Beeline CLI, and SPOT-221

2020-05-08 Thread skip cruse
+1. I don’t see any obvious downsides, and using beeline could potentially allow us to eventually use just one interface between Hive and Impala someday down the road if feel like consolidating some of the code. On Tue, Apr 28, 2020 at 3:03 PM Jeremy Nelson wrote: > I reviewed