Hi,

I can try to answer your questions about the matching output, though I'm not 
sure why the groups file in the first example is just the one line.  The second 
example seems reasonable, though.  The higher score for the pair 3,1 is more 
likely a match than 3,2.  The algorithm compares the fields marked as 
"include," as in "include this column in scoring."  In the second case, the 
fields used for both pairs agreed on "jim," but one pair has "A" compared with 
"A" while the other pair had "A" compared with "B."  The upward rating of 2 
agreeing fields pushed the score of the pairs 3,1 above the pair that had just 
one agreement and one disagreement.

Have you looked at the wiki page for the module?  It is at 
https://wiki.openmrs.org/display/docs/Patient+Matching+Module

There is information about the Swing GUI and some explanation of terms.  
Between the OpenMRS module and the Swing GUI, the term "must match" means it is 
a blocking column, and "should match" is included in scoring.

James

From: [email protected] [mailto:[email protected]] On Behalf Of 
Suranga Kasthurirathne
Sent: Monday, September 26, 2011 4:52 AM
To: [email protected]
Subject: [OPENMRS-IMPLEMENTERS] Newbie questions on patient Matching module


Hi all,

I've been looking at the patient matching module standalone, and trying to 
understand the matching concept.

Assuming I have table A as follows,
0|1|2|3 (Column IDs)
---------
1|max|m|A
2|jack|m|A
3|jim|m|A

and table B-
0|1|2|3 (Column IDs)
---------
1|jim|m|A
2|jim|m|B

If I block on Column 1, and include column 2, then I would expect it to 
identify that the third record in table A is similar to records 1 and 2 of 
table B.
After running the linkage process, I get the following results,

the _groups.txt file says  0|0.0|3|2|jim|jim|m|m|A|B

while the _new_match file says

0.0|3|1|jim|jim|m|m|A|A
0.0|3|2|jim|jim|m|m|A|B    (i'm assuming that the double (0.0) in both cols 
mean that both these records match evenly ?)


 *   any idea why the _groups.txt file fails to identify the relationship 
between 3|1 ?

Instead if I were to run the same test blocking on Column 1, and including 
columns 2 and 3,
I would expect it to identify that the third record in table A is similar to 
the first record of table B.

the _groups.txt file says 0|3.1699250014423126|3|1|jim|jim|m|m|A|A

while the _new_match file says

3.1699250014423126|3|1|jim|jim|m|m|A|A
-3.1699250014423126|3|2|jim|jim|m|m|A|B


 *   Can someone please help explain the data values given here ? I'm assuming 
that this means that the first record (3|1) has a high chance of matching  
while the second record (3|2) has really low chance of doing so ?


--
Best Regards,

Suranga

________________________________
Click here to 
unsubscribe<mailto:[email protected]?body=SIGNOFF%20openmrs-implement-l>
 from OpenMRS Implementers' mailing list

_________________________________________

To unsubscribe from OpenMRS Implementers' mailing list, send an e-mail to 
[email protected] with "SIGNOFF openmrs-implement-l" in the  body 
(not the subject) of your e-mail.

[mailto:[email protected]?body=SIGNOFF%20openmrs-implement-l]

Reply via email to