You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The dates (and some added strings around the dates) and phone numbers are what seem to differ between the sections, e.g.,:
>>>df[df['study_id'].isin(study_ids_with_descrepencies)][['section_impression']].iloc[0].item()
'\n1. 2021/2/6 QS/1 Data Systems 0951 HOURS AP AND LATERAL VIEWS OF THE UPRIGHT CHEST\nREDEMONSTRATE RIGHT INTERNAL JUGULAR DOUBLE LUMEN CATHETER WITH TIP\nIN THE REGION OF THE CAVOATRIAL JUNCTION.\n2. THERE HAS BEEN INTERVAL DEVELOPMENT OF BILATERAL LEFT GREATER\nTHAN RIGHT PLEURAL EFFUSIONS. THERE ARE ALSO INCREASED BIBASILAR\nOPACITIES WHICH ARE LIKELY COMPRESSIVE ATELECTASIS AS A RESULT OF THE\nEFFUSIONS. THESE FINDINGS ARE RELATIVELY ACUTE ONSET, AND SUGGEST\nPULMONARY EDEMA, ALTHOUGH INFECTION CANNOT BE EXCLUDED.\n'>>>df[df['study_id'].isin(study_ids_with_descrepencies)][['section_impression']].iloc[1].item()
'\n1. 3/18/2009 kollabio 0951 HOURS AP AND LATERAL VIEWS OF THE UPRIGHT CHEST\nREDEMONSTRATE RIGHT INTERNAL JUGULAR DOUBLE LUMEN CATHETER WITH TIP\nIN THE REGION OF THE CAVOATRIAL JUNCTION.\n2. THERE HAS BEEN INTERVAL DEVELOPMENT OF BILATERAL LEFT GREATER\nTHAN RIGHT PLEURAL EFFUSIONS. THERE ARE ALSO INCREASED BIBASILAR\nOPACITIES WHICH ARE LIKELY COMPRESSIVE ATELECTASIS AS A RESULT OF THE\nEFFUSIONS. THESE FINDINGS ARE RELATIVELY ACUTE ONSET, AND SUGGEST\nPULMONARY EDEMA, ALTHOUGH INFECTION CANNOT BE EXCLUDED.\n'
Another example:
>>>df[df['study_id'].isin(study_ids_with_descrepencies)][['section_impression']].iloc[6].item()
'\n\n1. Interval resolution of RIGHT pleural effusion without active \ndisease in the chest.\n\n"Physician to Physician Radiology Consult Line: (916) 919-2522"\n\n'>>>df[df['study_id'].isin(study_ids_with_descrepencies)][['section_impression']].iloc[7].item()
'\n\n1. Interval resolution of RIGHT pleural effusion without active \ndisease in the chest.\n\n"Physician to Physician Radiology Consult Line: (616) 985-3791"\n\n'
I was wondering if this was due to the de-identification process used; can we ignore these differences? I.e., have the dates and phone numbers been replaced with random versions?
Thanks,
A.
The text was updated successfully, but these errors were encountered:
Hi,
There are differences between the reports for images from the same study (from df_chexpert_plus_240401.csv):
This occurs with other sections as well.
The dates (and some added strings around the dates) and phone numbers are what seem to differ between the sections, e.g.,:
Another example:
I was wondering if this was due to the de-identification process used; can we ignore these differences? I.e., have the dates and phone numbers been replaced with random versions?
Thanks,
A.
The text was updated successfully, but these errors were encountered: