Hi,
I want to get rid of words that are less than three characters but I keep
getting errors. I tried multiple ways but keep getting errors.
Here is my code:
f1_name = "/Users/marysauerland/Documents/file1.txt"
#the opinions
f2_name = "/Users/marysauerland/Documents/file2.txt"
#the constitution
def read_words(words_file):
return [word.upper() for line in open(words_file, 'r') for word in
line.split()]
read_words(f1_name)
#performs the function on the file
set1 = set(read_words(f1_name))
#makes each word into a set and removes duplicate words
read_words(f2_name)
set2 = set(read_words(f2_name))
count_same_words = 0
for word in set1:
if word in set2:
count_same_words += 1
#comparing the set1 (set of unique words in the opinions) with set2 (set of
unique words in the constitution) and adding 1 for each matching word found
which is just counting the words
print(count_same_words)
Best,
Mary
_______________________________________________
Tutor maillist - [email protected]
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor