I'm a Dutch post-doc in the Antwerp ADReM group, and am currently supported by a Post-Doctoral Fellowship of the Research Foundation - Flanders (FWO). Before starting in Antwerp, I did my Ph.D. studies in Utrecht, the Netherlands, where I finished my thesis, 'Making Pattern Mining Useful', under supervision of prof.dr. Arno Siebes in the Algorithmic Data Analysis group at the Universiteit Utrecht in December 2009.
My research is mainly concerned with pattern mining, how to find interesting patterns, and how put them to good use. For this, insights from Information Theory in general, and the Minimum Description Length (MDL) and Maximum Entropy (ME) principles specifically, have proven to be valuable tools.
Currently I'm investigating how to extend the possibilities for identifying useful patterns by compression, how to efficiently mine patterns that compress better, explore other useful interestingness measures, and develop well-founded approaches for meaningful comparison between, and validation of, data mining results.
You might be interested in my publications, implementations, or our dedicated site on pattern set mining, or the 2012 workshop on Instant Interactive Data Mining. Below, you'll find an overview of my activities, as well as a selection of my recent publications.
Activities
-
Organization & Invited Talks
- Workshop co-chair of the 2012 IEEE International Conference on Data Mining (ICDM) to be held in Brussels, Belgium.
- Organizer of the ECML PKDD 2012 Workshop on Instant Interactive Data Mining (IID'12), organized in conjunction with ECML PKDD 2012 in Bristol, UK.
- Organizer and speaker of the 2011 tutorial Mining Sets of Patterns: Next Generation Pattern Mining at ICDM 2011.
- Invited speaker at the IEEE ICDM 2011 Workshop on Data Mining Technologies for Computational Collective Intelligence (DMCCI'11), organized in conjunction with IEEE ICDM 2011 in Vancouver, Canada.
- Organizer and speaker of the 2010 tutorial on Mining Sets of Patterns at ECML PKDD 2010 in Barcelona, Spain.
- Organizer of the ACM SIGKDD 2010 Workshop on Useful Patterns (UP'10), organized in conjunction with ACM SIGKDD 2010 in Washington, DC, USA.
- Invited speaker at the ECML PKDD 2008 Workshop From Local Patterns to Global Models (LeGo'08), organized in conjunction with ECML PKDD 2008 in Antwerp, Belgium.
- Member of the organizing committee of the 19th Dutch-Belgian Conference on Artificial Intelligence (BNAIC) 2007 in Utrecht, the Netherlands.
-
Awards & Grants
- KDD'11 Best Student Paper Award for 'Tell Me What I Need to Know'
- ACM SIGKDD Doctoral Dissertation Award 2010 Runner-Up
- ECML PKDD'09 Best Student Paper Award for 'Identifying the Components'
- Research Project 'Instant, Interactive & Adaptive Data Mining' of the Research Foundation Flanders (FWO) ('12-'15)
- Post-Doctoral Fellowship of the Research Foundation Flanders (FWO) ('10-'13)
- UA-BOF-KP Small Project (2010)
- UA-BOF-IWS Postdoctoral Researcher ('09-'10)
-
Program Committee Memberships
- ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '10-'12)
- IEEE International Conference on Data Mining (ICDM '12)
- European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD '08-'12)
- SIAM Conference on Data Mining (SDM '10-'11)
- International Conference on Advances in Social Network Analysis and Mining (ASONAM '12)
- International Conference on Pattern Recognition Applications and Methods (ICPRAM '12)
- Workshop on Discovering, Summarizing and Using Multiple Clusterings (MultiClust '11-'12)
- Workshop From Local Patterns to Global Models (LeGo '08-'09)
-
Reviewer for
- Data Mining and Knowledge Discovery (DAMI)
- Transactions on Knowledge Discovery and Data Mining (TKDD)
- Transactions on Knowledge and Data Engineering (TKDE)
- Transactions on Intelligent Systems and Technology (TIST)
- Knowledge and Information Systems (KAIS)
- Statistical Analysis and Data Mining (SAM)
- Social Network Analysis and Mining (SNAM)
- Information Systems (IS)
- ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) '07-'09
- ACM International Conference on Information and Knowledge Management (CIKM) '11
- IEEE International Conference on Data Mining (ICDM) '08
- International Conference on Data Warehousing and Knowledge Discovery (DaWaK) '06
- International Conference on Discovery Science (DS) '11
- European Conference on Artificial Life (ECAL) '06-'07
Teaching
-
Bachelor Courses
- Artificial Intelligence (3 ECTS) (2009-2012)
- Data Mining (3 ECTS) (2009-2011)
- Databases (7.2 ECTS, assisting) (2005-2006)
- Internet Programming (7.2 ECTS, assisting) (2006-2008)
-
Master Courses
- Advanced Data Mining (6 ECTS) (2009-2012)
- Project Databases (6 ECTS) (2009-2010)
- Database Security (3 ECTS) (2009-2010)
-
Master Thesis (co-)Supervision
- Tanja Van den Eede (2011)
- Andie Similon (2010)
- Sander Schuckmann (2009)
-
PhD Thesis (daily) (co-)Supervision
- Sandy Moens (ongoing)
- Dr. Koen Smets (16 May 2012)
- Dr. Michael Mampaey (21 Oct 2011)
2012 |
|
The Long and the Short of It: Summarizing Event Sequences with Serial Episodes. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'12), ACM, 2012. |
|
TourViz: Interactive Visualization of Connection Pathways in Large Graphs. Demo at: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'12), 2012. |
|
Slim: Directly Mining Descriptive Patterns. In: Proceedings of the SIAM International Conference on Data Mining (SDM'12), pp 236-247, SIAM, 2012. |
|
Summarizing Categorical Data by Clustering Attributes. In: Data Mining and Knowledge Discovery, Springer, 2012. (In Press) |
|
2011 |
|
Maximum Entropy Modelling for Assessing Results on Real-Valued Data. In: Proceedings of IEEE International Conference on Data Mining (ICDM'11), pp 350-359, IEEE, 2011. (oral presentation, 12.3% acceptance rate; overall 18%) |
|
Comparing Apples and Oranges - Measuring Differences between Data Mining Results. In: of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'11), pp 398-413, Springer, 2011. (invited for extension for best-of special issue, 3% acceptance rate; overall 20%) |
|
MIME: A Framework for Interactive Visual Pattern Mining. Demo at, and included in: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), pp 757-760, ACM, 2011. |
|
Model Order Selection for Boolean Matrix Factorization. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp 51-59, ACM, 2011. (oral presentation, 7.8% acceptance rate; overall 17.5%) |
|
Tell Me What I Need To Know: Succinctly Summarizing Data with Itemsets. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp 573-581, ACM, 2011. (Best Student Paper Award; oral presentation, 7.8% acceptance rate; overall 17.5%) |
|
Unraveling Tobacco BY-2 Protein Complexes with BN PAGE/LC-MS/MS and Clustering Methods. In: Journal of Proteomics, vol.74(8), pp 1201-1217, Elsevier, 2011. (IF 5.074) |
|
Krimp: Mining Itemsets that Compress. In: Data Mining and Knowledge Discovery, vol.23(1), pp 169-214, Springer, 2011. (IF 2.950) |
|
The Odd One Out: Identifying and Characterising Anomalies. In: Proceedings of the SIAM International Conference on Data Mining (SDM'11), SIAM, 2011. (25% acceptance rate.) |
|
2010 |
|
Useful Patterns (UP’10) ACM SIGKDD Workshop Report. In: ACM SIGKDD Explorations, vol.12(2), pp 56-58, ACM Press, 2010. |
|
Summarising Data by Clustering Items. In: Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD'10), pp 321-336, Springer, 2010. (18% acceptance rate) |
|

