New topic: 

Newbie question: how does this work?

<http://forums.realsoftware.com/viewtopic.php?t=47078>

         Page 1 of 1
   [ 1 post ]                 Previous topic | Next topic          Author  
Message        Markus Winter          Post subject: Newbie question: how does 
this work?Posted: Tue Feb 26, 2013 5:17 am                         
Joined: Sun Feb 19, 2006 4:00 pm
Posts: 1250
Location: Heidelberg, Germany                Hi all,

my last SQL experience was about 25 years ago, and my current attempt at 
getting my head around databases hits the usual amount of brickworks (well, 
maybe more than usual).

WI'm using the build-in database, and what I need to do is basically make a 
table of sentences, create a word list including a count, and correlate these 
two (in reality it is protein sequences and peptides from a digest but that's 
just semantics).

As I understand it:

• sentences have many words
• each work can be in many sentences
-> so there should be a many to many relationship for which I need a link table

Table SENTENCES
- Sentence_ID
- SentenceText
- NumberOfOccurences

Table WORDS
- Word_ID
- WordText
- NumberOfOccurences

Table WORDS_IN_SENTENCES
- Sentence_ID
- Word_ID

My problem now: words and sentences should only occur once in their respective 
tables. There is a property NumberOfOccurences which keeps track of how often a 
word occurs.

But how to do this in actuality?

To ensure that a word only occurs once I set the WordText to be unique.

But how do I now insert a word into the table?

I add a sentence to the table SENTENCES, split it, ... and now I'm unsure of 
how to proceed. How do I insert the word into the WORDS table and 
WORDS_IN_SENTENCES table?

I have a feeling that I need to check if a word is in WORDS, if so just add it 
to WORDS_IN_SENTENCES, if not add it to both, and at the very end (after 
processing all sentences) for each word count in WORDS_IN_SENTENCES how often 
it appears. Would that be the correct way of doing it? And how do I get the ID?

Considering my "word list" could be several hundred thousand's of words long 
speed might be an issue ...

Thanks for any advice.

Markus   
                             Top             Display posts from previous: All 
posts1 day7 days2 weeks1 month3 months6 months1 year Sort by AuthorPost 
timeSubject AscendingDescending          Page 1 of 1
   [ 1 post ]      
-- 
Over 1500 classes with 29000 functions in one REALbasic plug-in collection. 
The Monkeybread Software Realbasic Plugin v9.3. 
http://www.monkeybreadsoftware.de/realbasic/plugins.shtml

[email protected]

Reply via email to