您的账号已在其他设备登录,您当前账号已强迫下线,
如非您本人操作,建议您在会员中心进行密码修改

确定
收藏 | 浏览26

Most clinical and biomedical data contain missing values. A patient's record may be split across multiple institutions, devices may fail, and sensors may not be worn at all times. While these missing values are often ignored, this can lead to bias and error when the data are mined. Further, the data are not simply missing at random. Instead the measurement of a variable such as blood glucose may depend on its prior values as well as that of other variables. These dependencies exist across time as well, but current methods have yet to incorporate these temporal relationships as well as multiple types of missingness. To address this, we propose an imputation method (FLk-NN) that incorporates time lagged correlations both within and across variables by combining two imputation methods, based on an extension to k-NN and the Fourier transform. This enables imputation of missing values even when all data at a time point is missing and when there are different types of missingness both within and across variables. In comparison to other approaches on three biological datasets (simulated and actual Type 1 diabetes datasets, and multi-modality neurological ICU monitoring) the proposed method has the highest imputation accuracy. This was true for up to half the data being missing and when consecutive missing values are a significant fraction of the overall time series length.

作者:Shah Atiqur, Rahman;Yuxiao, Huang;Jan, Claassen;Nathaniel, Heintzman;Samantha, Kleinberg

来源:Journal of biomedical informatics 2015 年 58卷

知识库介绍

临床诊疗知识库该平台旨在解决临床医护人员在学习、工作中对医学信息的需求,方便快速、便捷的获取实用的医学信息,辅助临床决策参考。该库包含疾病、药品、检查、指南规范、病例文献及循证文献等多种丰富权威的临床资源。

详细介绍
热门关注
免责声明:本知识库提供的有关内容等信息仅供学习参考,不代替医生的诊断和医嘱。

收藏
| 浏览:26
作者:
Shah Atiqur, Rahman;Yuxiao, Huang;Jan, Claassen;Nathaniel, Heintzman;Samantha, Kleinberg
来源:
Journal of biomedical informatics 2015 年 58卷
标签:
Biomedical data Imputation Missing data Time series
Most clinical and biomedical data contain missing values. A patient's record may be split across multiple institutions, devices may fail, and sensors may not be worn at all times. While these missing values are often ignored, this can lead to bias and error when the data are mined. Further, the data are not simply missing at random. Instead the measurement of a variable such as blood glucose may depend on its prior values as well as that of other variables. These dependencies exist across time as well, but current methods have yet to incorporate these temporal relationships as well as multiple types of missingness. To address this, we propose an imputation method (FLk-NN) that incorporates time lagged correlations both within and across variables by combining two imputation methods, based on an extension to k-NN and the Fourier transform. This enables imputation of missing values even when all data at a time point is missing and when there are different types of missingness both within and across variables. In comparison to other approaches on three biological datasets (simulated and actual Type 1 diabetes datasets, and multi-modality neurological ICU monitoring) the proposed method has the highest imputation accuracy. This was true for up to half the data being missing and when consecutive missing values are a significant fraction of the overall time series length.