In many hospitals, data related to patients are observed and collected to a central database for medical research. For instance, DPC dataset, which stands for Disease, Procedure and Combination, covers medical records for more than 7 million patients in more than 1000 hospitals. Using the distributed DPC data set, a number of epidemiological studied are feasible to reveal useful knowledge on medical treatments. Hence, cryptography helps to preserve the privacy of personal data. The study called as Privacy-Preserving Data Mining (PPDM) aims to perform a data mining algorithm with preserving confidentiality of datasets.
This paper studies the scalability of privacy-preserving data mining in epidemiological study. As for the data-mining algorithm, we focus to a linear regression since it is used in many applications and simple to be evaluated. We try to identify the linear model to estimate a length of hospital stay from distributed dataset related to the patient and the disease information. Our contributions of this paper include (1) to propose privacy-preserving protocols for linear regression with horizontally or vertically partitioned datasets, and (2) to clarify the limitation of size of problem to be performed. These information are useful to determine the dominant element in PPDM and to figure out the direction of study for further improvement.
Continue Reading
With the push of cloud computing which has both resource and compute scalability, data, which has been exploding in the past years, are often outsourced to a server. To this…
This paper presents experimental results of overwater radio path loss in the 4.4 to 5.0 GHz band. These measurements involve horizontal and vertical polarizations. Based on electromagnetic theory, a deterministic…
The Quantified Self is a movement that promotes the use of technology for self-tracking various kinds of personal information, such as physical activities and energy consumption. In this paper, we…
Guaranteed-Service Approach (GSA) was used to set safety stocks for multi-echelon inventory systems. This approach assumes that each stock can use operating flexibility measures such as expediting and overtime to…
Cooperative communication is considered one of efficient techniques to overcome fading in wireless networks since it can achieve spatial diversity. Employing all the relays in the network to transfer the data, consumes…
A monopole antenna loaded with a circular disc is integrated on a glass interposer layer for millimeter-wave wireless communication applications. A Through Glass Via (TGV) is used as a main radiator and…
The diversity-multiplexing trade-off (DMT) expresses the optimal trade-off between the transmission rate and the error probability for communications at high signal to noise ratios (SNR) in wireless networks with fading channels. For…
This paper proposes a discovery method which utilizes Timing Advance (a standardized parameter) to determine the proximity of the device-to-device (D2D) users in Long Term Evolution Advanced (LTE-A) cellular networks.…
With the explosion of big data, processing large numbers of continuous data streams, i.e., big data stream processing (BDSP), has become a crucial requirement for many scientific and industrial applications…