[Talk-us] New MapRoulette Challenge: Website Mismatch: Give it a try

2015-05-03 Thread Bryce Nesbitt
This is a website keyword matcher.

For example a website associated with name=Cafe Fanny/phone=555-1212
would be expected to have at least the words Cafe, Fanny, or a matching
phone number.

The match is fuzzy. Any website with no match is flagged for human review.
These are often:

   - Spam websites.
   - Hijacked domains.
   - Business that have changed focus.
   - Out of business businesses.
   - Website has moved.

There are false positives also:

   - Redirects with error codes in the 300 range.
   - Flash based websites with no text whatsoever.
   - Chain stores and schools that frequently reorganize their pages.
   - Small regional chains where the store page is either non-existent, or
   a URL that's sure to chance in the future.  The root URL is better, but
   won't match the name on the node.

If you work on the challenge you'll appreciated crafting a future proof
URL.
The best URL for osm is the simplest one if possible (no
http://example.org/a/b/q?q=50505 ).

The challenge shows how quickly restaurants in particular come and go.



-
The challenge works regionally so you don't have to save after every node.
Right now it's set for the Northwest USA:

http://maproulette.org/#t=webtegrity/www-osmid-1723639413
___
Talk-us mailing list
Talk-us@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk-us


Re: [Talk-us] New MapRoulette Challenge: Website Mismatch: Give it a try

2015-05-03 Thread Bryce Nesbitt
This is a website keyword matcher.

For example a website associated with name=Cafe Fanny/phone=555-1212
would be expected to have at least the words Cafe, Fanny, or a matching
phone number.

The match is fuzzy. Any website with no match is flagged for human review.
These are often:

   - Spam websites.
   - Hijacked domains.
   - Business that have changed focus.
   - Out of business businesses.
   - Website has moved.

There are false positives also:

   - Redirects with error codes in the 300 range.
   - Flash based websites with no text whatsoever.
   - Chain stores and schools that frequently reorganize their pages.
   - Small regional chains where the store page is either non-existent, or
   a URL that's sure to change in the future.

If you work on the challenge you'll appreciate well crafted and future
proof URLs.
The challenge shows how quickly restaurants in particular come and go.


-
The challenge works regionally so you don't have to save after every node.
Right now it's set for the Northwest USA:

http://maproulette.org/#t=webtegrity/
___
Talk-us mailing list
Talk-us@openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk-us