R, SQL, i2b2, and governance RE: Example (Re: Empirical Data Dictionary)

2014-10-06 Thread Dan Connolly
(Please excuse the awkward top-posting format; I'm stuck with Microsoft Outlook.) Perhaps we're converging... the new Data Builder code delivers an sqlite3 file, so you can continue to use SQL to analyze it; and if you like

RE: Example (Re: Empirical Data Dictionary)

2014-10-06 Thread Dan Connolly
What you've written is entirely responsive to my questions. As I say, I'm largely ignorant of this whole field (emperical data validation), so I appreciate the being educated by way of verbose specifics. I think I have a few substantive questions in response, but I want to read over what you wr

Re: Example (Re: Empirical Data Dictionary)

2014-10-06 Thread Alex Bokov
First of all, I feel like I'm treading into sensitive territory here, in that where I point out on problems it can be misinterpreted as criticizing other people's work. So let me just re-affirm: We are all geeks here. It's us against the bugs. When we engage in debate, both sides win by findin

RE: pattern of cohort characterization needs? RE: Empirical Data Dictionary

2014-10-06 Thread Dan Connolly
Nice. For bonus points, update the obesity data elements ticket (#33) to note this pattern in general and the obesity aspects in particular. For sensitive material, put it in the KUMC REDCap project I recently invited you to: * GPC

gpc-dev agenda 7 Oct

2014-10-06 Thread Dan Connolly
I don't expect we'll get through all of this, so come prepared with input on the order... Tues 11amCT: 1. Convene, take roll, review records and plan next meeting * ​Meeting ID and access code: 686-845-717; call +1 (267) 507-0008

Re: [gpc-informatics] #56: shareable synthetic test data sets: Epic clarity, i2b2, NAACCR, ...

2014-10-06 Thread GPC Informatics
#56: shareable synthetic test data sets: Epic clarity, i2b2, NAACCR, ... --+ Reporter: dconnolly | Owner: bokov Type: enhancement | Status: assigned Priority: major | Milestone: data-domains2 Component: dat

Re: [gpc-informatics] #56: shareable synthetic test data sets: Epic clarity, i2b2, NAACCR, ...

2014-10-06 Thread Alex Bokov
I now feel that we haven't delved deeply enough into Epic and NAACCR yet to trust simulations of them, let alone a general purpose simulator like originally proposed. Even I2B2 foils it right off the start because its EAV tables violate the assumptions we originally had about pulling out the re

Re: [gpc-informatics] #158: usable view of LOINC lab terms

2014-10-06 Thread GPC Informatics
#158: usable view of LOINC lab terms -+ Reporter: rwaitman | Owner: budh0007 Type: enhancement | Status: assigned Priority: major| Milestone: data-domains2 Component: data-stds| Resolution: Keywords:

Re: [gpc-informatics] #56: shareable synthetic test data sets: Epic clarity, i2b2, NAACCR, ...

2014-10-06 Thread GPC Informatics
#56: shareable synthetic test data sets: Epic clarity, i2b2, NAACCR, ... --+ Reporter: dconnolly | Owner: bokov Type: enhancement | Status: assigned Priority: major | Milestone: data-domains2 Component: dat

Re: pattern of cohort characterization needs? RE: Empirical Data Dictionary

2014-10-06 Thread Alex Bokov
I don't claim this covers all cases, only the ones I can think of so far. If anyone can think of a cohort characterization question that cannot be answered by the below procedure, I am interested in learning about it. On 10/03/2014 04:56 PM, Dan Connolly wrote: > > "most cohort characterization ne

RE: Updated milestone report from todays GPC-DEV call; 4Public

2014-10-06 Thread John Steinmetz
Here is the link to the deliverable on CDT. It is one document that contains 2.1, 2.5, and 2.7. 2.3 was previously submitted, and can be found on CDT at this link. John. From: Da

RE: Updated milestone report from todays GPC-DEV call; 4Public

2014-10-06 Thread Dan Connolly
Please copy gpc-dev when you submit it, John. -- Dan From: gpc-dev-boun...@listserv.kumc.edu [gpc-dev-boun...@listserv.kumc.edu] on behalf of John Steinmetz [jsteinm...@kumc.edu] Sent: Tuesday, September 30, 2014 1:24 PM To: 'Campbell, James R'; gpc-dev@listserv.

RE: Example (Re: Empirical Data Dictionary)

2014-10-06 Thread Dan Connolly
I started trying out the script last week. 1st bit of feedback: It was taking a lot longer than the estimated 20 minutes. Just building the 1st intermediate table took longer than that. I have since lost my sesson/context (had to reboot my desktop... sigh...) I guess my high order feedback is:

Example (Re: Empirical Data Dictionary)

2014-10-06 Thread Alex Bokov
Kind thanks to KUMC and the sites that volunteered to test. With help from Wisconsin and MCW so far, I have a lot of revisions to add to the original script that make the output smaller, the syntax less Oracle-specific, and eliminate or scrub certain fields. If you're not one of the test sites, yo

Re: Empirical Data Dictionary

2014-10-06 Thread Debbie Yoshihara
Alex made some modifications to the SQL because Netezza couldn't handle some of the Oracle SQL. I'll give you the counts in this document, along with the specific SQL code I used. OUTPUT_CON_MOD 1,632,942 rows OUTPUT_BASIC_DEMOG 126,339 rows OUTPUT_VISIT 13 rows OUTPUT_PROVIDER 38147 rows As