Wasn't my number. That was the price quoted me by the Predictions team,
given the data set were attempting to load.

 

And using NLTK in combination with a library I built I get very high
accuracy on sentence context.  Not Meaning, just subject matter.

 

My high confidence comes from this being one of the last places I go to ask
questions.  If I end up asking here it is usually because it is 3am and
everyone else is sleeping. Or because I exhausted everything else and am
hoping some crazy person here knows about some guy in Korea who did a port
but the documentation is in a funny language.

 

I ended up writing my own Word and Sentence Tokenizer and am starting on a
Stemmer.  They are much faster than the NLKT version, but not having tested
them with 10 billion words I always worry that I will have screwed something
up.

 

 

From: [email protected]
[mailto:[email protected]] On Behalf Of alex
Sent: Monday, March 05, 2012 12:51 PM
To: [email protected]
Subject: Re: [google-appengine] Re: NLTK or Equivalent

 

Well, that only tells me you haven't played enough with your data or haven't
chosen best models to base your predictions of. No NLP software/whatever in
the world will tell you "what is this sentence about" with high confidence
without giving the system a concrete context. That's a task people's been
trying to solve for many years. Once you have a context though, you're not
that far from what Predictions API does.

 

It pretty much often looks like you're taking some numbers from the sky and
round them up. Take a close look at
https://developers.google.com/prediction/docs/pricing, 10k
predictions/months cost $0; $0.50 for each 1k above 10k; $0.50 for 250Mb of
1 dataset training. cloud storage cost nothing. I don't see $20k not even
close.

 

Well, that of course depends on what you need exactly. Predictions API
obviously isn't a silver bullet. 

 

Nice thing about your posts though is you often get me laughing reading your
so highly confident numbers :)



On Monday, March 5, 2012 6:58:55 PM UTC+1, Brandon Wirtz wrote:

Kind of sort of, not really.

 

Prediction doesn't give you back any of the NLP, so you can't say "What is
this sentence about", or what was user trying to search for.
But also it is SOOOO VERYYYYYY Expensive.    Loading the training data for
10k users to build something like a content recommendation system and then
loading all of the data to make recommendations from was going to be in the
neighborhood of $20k before I got to the point that I could even evaluate if
the system was going to work.

I have played with the prediction API for looking at traffic data to say
"Hey you are always 'UP' in October" but that was hardly rocket science.

 

What I specifically need in this case is the ability to boil content down to
the important bits.

 

 

From: [email protected]
[mailto:[email protected]] On Behalf Of alex
Sent: Monday, March 05, 2012 10:42 AM
To: [email protected]
Subject: [google-appengine] Re: NLTK or Equivalent

 

dunno what you're working so, it might totally be not your case but have you
considered https://developers.google.com/prediction/ API? you can do some
cool stuff related to NLP.


alex.


On Monday, March 5, 2012 6:21:32 PM UTC+1, Brandon Wirtz wrote:

I was looking to do some things using NLTK and found
code.google.com/p/nltk-gae/ which looks promising, but the code on the site
has quite a few errors (most of which I worked through) but has a LOT to be
implemented (and relies on MemCache never being flushed to work)

Does anyone know of a better NLTK for GAE implementation? Or do we have an
ETA on NLTK being supported on GAE?

 

-Brandon

-- 
You received this message because you are subscribed to the Google Groups
"Google App Engine" group.
To view this discussion on the web visit
https://groups.google.com/d/msg/google-appengine/-/WouZtSgGQdUJ.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/google-appengine?hl=en.

-- 
You received this message because you are subscribed to the Google Groups
"Google App Engine" group.
To view this discussion on the web visit
https://groups.google.com/d/msg/google-appengine/-/u1lph_nA054J.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/google-appengine?hl=en.

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en.

Reply via email to