TY - GEN
T1 - Application of clustering methods to health insurance fraud detection
AU - Peng, Yi
AU - Kou, Gang
AU - Sabatka, Alan
AU - Chen, Zhengxin
AU - Khazanchi, Deepak
AU - Shi, Yong
PY - 2006
Y1 - 2006
N2 - Health insurance fraud detection is an important and challenging task. Traditionally, insurance companies use human inspections and heuristic rules to detect fraud. As the size of databases increases, the traditional approaches may miss a great portion of fraud for two main reasons. First, it is impossible to detect all health care fraud by manual inspection over large databases. Second, new types of health care fraud emerge constantly. SQL operations based on heuristic rules cannot identify those new emerging fraud schemes. Such a situation demands more sophisticated analytical methods and techniques that are capable of detecting fraud activities from large databases. The goal of this paper is to understand and detect suspicious health care frauds from large databases using clustering technique. Specifically, this paper applies two clustering methods, SAS EM and CLUTO, to a large real-life health insurance dataset and compares the performances of these two methods.
AB - Health insurance fraud detection is an important and challenging task. Traditionally, insurance companies use human inspections and heuristic rules to detect fraud. As the size of databases increases, the traditional approaches may miss a great portion of fraud for two main reasons. First, it is impossible to detect all health care fraud by manual inspection over large databases. Second, new types of health care fraud emerge constantly. SQL operations based on heuristic rules cannot identify those new emerging fraud schemes. Such a situation demands more sophisticated analytical methods and techniques that are capable of detecting fraud activities from large databases. The goal of this paper is to understand and detect suspicious health care frauds from large databases using clustering technique. Specifically, this paper applies two clustering methods, SAS EM and CLUTO, to a large real-life health insurance dataset and compares the performances of these two methods.
KW - Clustering
KW - Database
KW - Insurance fraud detection
UR - http://www.scopus.com/inward/record.url?scp=40649128309&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=40649128309&partnerID=8YFLogxK
U2 - 10.1109/ICSSSM.2006.320598
DO - 10.1109/ICSSSM.2006.320598
M3 - Conference contribution
AN - SCOPUS:40649128309
SN - 1424404517
SN - 9781424404513
T3 - Proceedings - ICSSSM'06: 2006 International Conference on Service Systems and Service Management
SP - 116
EP - 120
BT - Proceedings - ICSSSM'06
PB - IEEE Computer Society
T2 - ICSSSM'06: 2006 International Conference on Service Systems and Service Management
Y2 - 25 October 2006 through 27 October 2006
ER -