Chat.HL7.org Zulip Archive

For questions about this archive, please contact webmaster@hl7.org.

Stream: V2

Topic: anonymization/deidentfication of v2 messages?


view this post on Zulip Oliver Egger (Oct 31 2018 at 15:49):

hi, has anyone a pointer to a tool/software which can deidentify or anonymize hl7 v2 messages?

view this post on Zulip Lloyd McKenzie (Oct 31 2018 at 16:15):

Deidentification and anonymization are risk reduction processes that require context in order to evaluate risk of identification and in order to know what sort of information must be retained to support purpose of use. Add onto that that v2 systems are notorious for not populating segments consistently plus frequent reliance on Z-segments for important information, I think you'll be hard-pressed to find a generic tooling solution.

view this post on Zulip Lloyd McKenzie (Oct 31 2018 at 16:15):

@John Moehrke

view this post on Zulip John Moehrke (Oct 31 2018 at 18:32):

I agree with Lloyd. There might be some tools that are more aware of how to do this, but I am sure there is little in the way of off-the-shelf. Note, that all de-identification is a process that is specific to the data you have and the intended use of the resulting de-identified data. This process would look to the content of each element (e.g. each z-segment), and the need for the data to be maintained. see my article on this https://healthcaresecprivacy.blogspot.com/2015/02/is-it-really-possible-to-anonymize-data.html

view this post on Zulip Oliver Egger (Nov 01 2018 at 09:08):

@Lloyd McKenzie and @John Moehrke thanks a lot for the information provided! i completely agree with your statements but I hoped maybe someone had already tackled part of it with a software tooling stack.

view this post on Zulip John Moehrke (Nov 01 2018 at 12:48):

possibly https://privacy-analytics.com/

view this post on Zulip John Moehrke (Nov 01 2018 at 12:48):

I would not be too surprised that the large hl7 v2 interface engines had some capability

view this post on Zulip John Moehrke (Nov 01 2018 at 12:50):

the creation of the algorithm that is specific to your data, and your intended use of the resulting data set is the hard part, and something that is not possible to automate (well, except for the tried-and-true algorithm > /dev/null


Last updated: Mar 23 2020 at 00:02 UTC