TY - JOUR
T1 - A feature-free and parameter-light multi-task clustering framework
AU - Huy, Thach Nguyen
AU - Shao, Hao
AU - Tong, Bin
AU - Suzuki, Einoshin
N1 - Funding Information:
A part of this research is supported by the Strategic International Cooperative Program funded by Japan Science and Technology Agency (JST) and by the grant-in-aid for scientific research on fundamental research (B) 21300053 from the Japanese Ministry of Education, Culture, Sports, Science and Technology. Thach Nguyen Huy is sponsored by the JICA Scholarship.
PY - 2013/7
Y1 - 2013/7
N2 - The two last decades have witnessed extensive research on multi-task learning algorithms in diverse domains such as bioinformatics, text mining, natural language processing as well as image and video content analysis. However, all existing multi-task learning methods require either domain-specific knowledge to extract features or a careful setting of many input parameters. There are many disadvantages associated with prior knowledge requirements for feature extraction or parameter-laden approaches. One of the most obvious problems is that we may find a wrong or non-existent pattern because of poorly extracted features or incorrectly set parameters. In this work, we propose a feature-free and parameter-light multi-task clustering framework to overcome these disadvantages. Our proposal is motivated by the recent successes of Kolmogorov-based methods on various applications. However, such methods are only defined for single-task problems because they lack a mechanism to share knowledge between different tasks. To address this problem, we create a novel dictionary-based compression dissimilarity measure that allows us to share knowledge across different tasks effectively. Experimental results with extensive comparisons demonstrate the generality and the effectiveness of our proposal.
AB - The two last decades have witnessed extensive research on multi-task learning algorithms in diverse domains such as bioinformatics, text mining, natural language processing as well as image and video content analysis. However, all existing multi-task learning methods require either domain-specific knowledge to extract features or a careful setting of many input parameters. There are many disadvantages associated with prior knowledge requirements for feature extraction or parameter-laden approaches. One of the most obvious problems is that we may find a wrong or non-existent pattern because of poorly extracted features or incorrectly set parameters. In this work, we propose a feature-free and parameter-light multi-task clustering framework to overcome these disadvantages. Our proposal is motivated by the recent successes of Kolmogorov-based methods on various applications. However, such methods are only defined for single-task problems because they lack a mechanism to share knowledge between different tasks. To address this problem, we create a novel dictionary-based compression dissimilarity measure that allows us to share knowledge across different tasks effectively. Experimental results with extensive comparisons demonstrate the generality and the effectiveness of our proposal.
UR - http://www.scopus.com/inward/record.url?scp=84878829245&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84878829245&partnerID=8YFLogxK
U2 - 10.1007/s10115-012-0550-5
DO - 10.1007/s10115-012-0550-5
M3 - Article
AN - SCOPUS:84878829245
SN - 0219-1377
VL - 36
SP - 251
EP - 276
JO - Knowledge and Information Systems
JF - Knowledge and Information Systems
IS - 1
ER -