Grafia   Graph Information Processing and Analysis


Publications [dblp][category]

Journal Papers

  1. Co-occurrence Based Diffusion for Expert Search On the Web, By Z. Guan, G. Miao, R. McLoughlin, X. Yan, D. Cai TKDE'12, Transactions on Knowledge and Data Engineering, 2012 [pdf]
  2. Graph OLAP: A Multi-Dimensional Framework for Graph Data Analysis,
    By C. Chen, X. Yan, F. Zhu, J. Han, P. S Yu,
    KAIS'09, Knowledge and Information Systems: An International Journal, 2009 [pdf]
  3. Report on the First International Workshop on Mining Graphs and Complex Structures,
    By L. Holder and X. Yan,
    SIGMOD Record 37(1): 53-55, 2008 [pdf]
  4. Frequent Pattern Mining: Current Status and Future Directions,
    by J. Han, H. Cheng, D. Xin and X. Yan,
    DMKD'07 (Data Mining and Knowledge Discovery, 10th Anniversary Issue), 2007 [pdf]
  5. On compressing frequent patterns,
    by D. Xin, J. Han, X. Yan, H. Chen, 
    DKE'07 (Data Knowledge Engineering), 60(1): 5-29, 2007 [pdf]
  6. Integrative Array Analyzer: A Software Package for Analysis of Cross-platform and Cross-species Microarray Data,
    by F. Pan, K Kamath, K. Zhang, S. Pulapura, A. Achar, J. Nunez-Iglesias, Y. Huang, X. Yan, J. Han, H. Hu, M. Xu, J. Hu, and X. Jasmine Zhou,
    Bioinformatics'06
    , Vol.22 no.13: 1665-1667, 2006. [pdf]
  7. Feature-based Substructure Similarity Search, 
    by X. Yan, F. Zhu, P. S. Yu, and J. Han,
    ACM-TODS'06 (ACM Transactions on Database Systems), Dec. 2006. [pdf]
  8. Statistical Debugging: A Hypothesis Testing-based Approach,
    by  C. Liu, L. Fei, X. Yan, J. Han and S. Midkiff,
    IEEE-TSE'06 (IEEE Transaction on Software Engineering), 32(10):831-848, 2006. [pdf]
  9. Graph Indexing Based on Discriminative Frequent Structure Analysis, 
    by X. Yan, P. S. Yu, and J. Han,
    ACM-TODS'05 (ACM Transactions on Database Systems), Dec. 2005. [pdf]
  10. TSP: Mining Top-K Closed Sequential Patterns,  
    by P. Tzvetkov, X. Yan, and J. Han,
    KAIS'05 (Knowledge and Information Systems: An International Journal), 7:438-457, 2005. [pdf]
  11. From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach, 
    by J. Han, J. Pei, and X. Yan,
    JCST'04 (Journal of Computer Science and Technology), 19(3): 257-279, 2004. [pdf]

Conference Papers

  1. Towards Effective Partition Management for Large Graphs,
    by S. Yang, X. Yan, B. Zong, A. Khan
    SIGMOD'12 (Proc. 2012 Int. Conf. on Management of Data), Jun 2012 [pdf]
  2. Understanding Task-driven Information Flow in Collaborative Networks,
    by G. Miao, S. Tao, W. Cheng, J. Moulic, L. Moser and X. Yan,
    WWW'12 (Proc. 2012 Int. World Wide Web Conference), April 2012 [pdf] (conditional acceptance)
  3. Efficient multicasting for delay tolerant networks using graph indexing,
    by M. Mongiovi, A. Singh, X. Yan, B. Zong, K. Psounis,
    INFOCOM'12 (Proc. 2012 Int. Conf. on Computer Communications), March 2012 [pdf]
  4. PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks,
    by Y. Sun, J. Han, X. Yan, P. S. Yu, T Wu,
    VLDB'11 (Proc. 2011 Int. Conf. on Very Large Data Bases), Aug 2011 [pdf]
  5. Mining Top-K Large Structural Patterns in a Massive Network,Th
    by F. Zhu, Q. Qu, D. Lo, X. Yan, J. Han, and P. Yu,
    VLDB'11 (Proc. 2011 Int. Conf. on Very Large Data Bases), Aug 2011 [pdf]
  6. Neighborhood Based Fast Graph Search in Large Networks,
    by A. Khan, N. Li, Z. Guan, X. Yan, S. Chakraborty, and S. Tao,
    SIGMOD'11 (Proc. 2011 Int. Conf. on Management of Data), June 2011 [pdf]
  7. Assessing and Ranking Structural Correlations in Graphs,
    by Z. Guan, J. Wu, Q. Zhang, A. Singh, and X. Yan,
    SIGMOD'11 (Proc. 2011 Int. Conf. on Management of Data), June 2011 [pdf]
  8. On Flow Authority Discovery in Social Networks,
    by C. Aggarwal, A. Khan and X. Yan,
    SDM'11 (Proc. 2011 SIAM International Conference on Data Mining),  Apr. 2011 [pdf]
  9. Content-Aware Resolution Sequence Mining for Ticket Routing,
    by P. Sun, S. Tao, X. Yan, N. Anerousis, Y. Chen,
    BPM'10 (The 8th Int. Conf. on Business Process Management),  Sep. 2010 [pdf]
  10. Generative Models for Ticket Resolution in Expert Networks
    G. Miao, L. Moser, X. Yan, S. Tao, Y. Chen, and N. Anerousis
    SIGKDD'10 (Proc. of 2010 Int. Conf. on Knowledge Discovery and Data Mining), Jul. 2010 [pdf]
  11. Assessing Expertise Awareness in Resolution Networks
    Y. Chen, S. Tao, X. Yan, N. Anerousis, and Q. Shao
    ASONAM'10 (Proc. 2010 International Conference on Social Networks Analysis and Mining), Aug. 2010 [pdf]
  12. Synthesizing Near-Optimal Malware Specifications from Suspicious Behaviors,
    M. Fredrikson, M. Christodorescu, S. Jha, R. Sailer, and X. Yan,
    Oakland'10 (31st IEEE Symp. on Security & Privacy), May 2010 [pdf]
  13. Towards Proximity Pattern Mining in Large Graphs,
    A. Khan, X. Yan and K.-L. Wu,
    SIGMOD'10 (Proc. 2010 Int. Conf. on Management of Data), June 2010 [pdf]
  14. Mining Diversity on Networks,
    L. Liu, F. Zhu, C. Chen, X. Yan, J. Han, P. S. Yu, and S. Yang,
    DASFAA'10
    (Proc. 2010 Int. Conf. on Database Systems for Advanced Applications), 2010 [pdf]

  15. Cross-Selling Optimization for Customized Product Promotion,
    N. Li, Y. Yang, X. Yan,
    SDM'10 (Proc. 2010 SIAM International Conference on Data Mining), April 2010 [pdf]
  16. Top-K Aggregation Queries over Large Networks,
    X. Yan, B. He, F. Zhu, and J. Han,
    ICDE'10 (Proc. 2010 Int. Conf. on Data Engineering), Mar. 2010 [pdf]
  17. Mining Graph Patterns Efficiently via Randomized Summaries,
    C. Chen, C. Lin, M. Fredrikson, M. Christodorescu, X. Yan, and J. Han,
    VLDB'09 (Proc. 2009 Int. Conf. on Very Large Data Bases), Aug. 2009 [pdf]
  18. Identifying Bug Signatures Using Discriminative Graph Mining,
    by H. Cheng, D. Lo, Y. Zhou, X. Wang and X. Yan,
    ISSTA'09 (Proc. 2009 Int. Symp. On Software Testing and Analysis), Jul. 2009 [pdf]
  19. Near-Optimal Supervised Feature Selection among Frequent Subgraphs,
    by M. Thoma, H. Cheng, A. Gretton, J. Han, H.-P. Kriegel, A. Smola, L. Song, P. S. Yu, X. Yan, and K. Borgwardt,
    SDM'09 (Proc. 2009 SIAM Int. Conf. on Data Mining), Apr. 2009  [pdf]
  20. SmallBlue: Social Network Analysis for Expertise Search and Collective Intelligence,
    by C. Lin, N. Cao, S. Liu, S. Papadimitriou, J. Sun, X. Yan,
    ICDE'09 (Proc. of 2009 Int. Conf. on Data Engineering ), Mar. 2009 [pdf]
  21. Graph OLAP: Towards Online Analytical Processing on Graphs,
    by C. Chen, X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDM'08 (Proc. 2008 Int. Conf. on Data Mining), Dec. 2008 [pdf]
  22. On Effective Presentation of Graph Patterns: A Structural Representative Approach,
    by C. Chen, X. Lin, X. Yan, and J. Han,
    CIKM'08 (Proc. 2008 ACM Conf. on Information and Knowledge Management), Oct. 2008 [pdf]
  23. Efficient Ticket Routing by Resolution Sequence Mining,
    by Q. Shao, Y. Chen, S. Tao, X. Yan, N. Anerousis,
    SIGKDD'08 (Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining), Aug. 2008 [pdf]
  24. Direct Mining of Discriminative and Essential Graphical and Itemset Features via Model-based Search Tree,
    by W. Fan, K. Zhang, H. Cheng, J. Gao, X. Yan, J. Han, P. S. Yu, O. Verscheure,
    SIGKDD'08 (Proc. of 2008 Int. Conf. on Knowledge Discovery and Data Mining),  Aug. 2008 [pdf]
  25. Mining Significant Graph Patterns by Scalable Leap Search,
    by X. Yan, H. Cheng, J. Han, and P. S. Yu,
    SIGMOD'08 (Proc. 2008 ACM SIGMOD Int. Conf. on Management of Data), Jun. 2008 [pdf][ppt][dataset]
  26. Direct Discriminative Pattern Mining for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and P. S. Yu,
    ICDE'08 (Proc. of 2008 Int. Conf. on Data Engineering), Apr. 2008. [pdf]
  27. gApprox: Mining Frequent Approximate Patterns from a Massive Network,
    by C. Chen, X. Yan, F. Zhu, and J. Han.
    ICDM'07a (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  28. Efficient Discovery of Frequent Approximate Sequential Patterns,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    ICDM'07b (Proc. of 2007 Int. Conf. on Data Mining), Oct. 2007. (short paper) [pdf]
  29. Towards Graph Containment Search and Indexing,
    by C. Chen, X. Yan, P. S. Yu, J. Han, D.-Q. Zhang and X. Gu.
    VLDB'07a (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  30. EntityRank: Searching Entities Directly and Holistically,
    by T. Cheng, X. Yan and K. Chang.
    VLDB'07b (Proc. of 2007 Int. Conf. on Very Large Data Bases), Sep. 2007. [pdf]
  31. A Graph-Based Approach to Systematically Reconstruct Human Transcriptional Regulatory Modules,
    by X. Yan, M. Mehan, Y. Huang, M. S. Waterman, P. S. Yu, and X. Zhou.
    ISMB'07a (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  32. Systematic Discovery of Functional Modules and Context-Specific Functional Annotation of Human Genome,
    by Y. Huang, H. Li, H. Hu, X. Yan, M. S. Waterman, H. Huang, and X. Zhou.
    ISMB'07b (the 15th Annual Int. Conf. on Intelligent Systems for Molecular Biology), Jul. 2007. [pdf]
  33. gPrune: A Constraint Pushing Framework for Graph Pattern Mining,
    by F. Zhu, X. Yan, J. Han, and P. S. Yu.
    PAKDD'07 (Proc. of 2007 Pacific-Asia Conference on Knowledge Discovery and Data Mining), May 2007. Best Student Paper. [pdf]
  34. Mining Colossal Frequent Patterns by Core Pattern Fusion,
    by F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Cheng.
    ICDE'07a (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. Best Student Paper [pdf]
  35. Discriminative Frequent Pattern Analysis for Effective Classification,
    by H. Cheng, X. Yan, J. Han, and C. Hsu.
    ICDE'07b (Proc. of 2006 Int. Conf. on Data Engineering), Apr. 2007. [pdf]
  36. Extracting Redundancy-aware Top-k Patterns,
    by D. Xin, H. Cheng, X. Yan, J. Han, 
    SIGKDD'06 (Proc. of 2006 Int. Conf. on Knowledge Discovery and Data Mining). [pdf]
  37. Mining Control Flow Abnormality for Logic Error Isolation,

    by C. Liu, X. Yan, and J. Han,

    SDM'06 (Proc. of 2006 SIAM Int. Conf. on Data Mining), 2006. [pdf]

  38. Searching Substructures with Superimposed Distance, 
    by X. Yan, F. Zhu, J. Han, and P. S. Yu,
    ICDE'06 (Proc. of 2006 Int. Conf. on Data Engineering), 2006. [pdf] [ppt_slides]
  39. Community Mining from Multi-Relational Networks, 
    by D. Cai, Z. Shao, X. He, X. Yan, J. Han,
    PKDD'05 (Proc. of 2005 European Conf. on Principles and Practice of Knowledge Discovery in Databases), 2005. [pdf]
  40. SOBER: Statistical Model-based Bug Localization, 
    by C. Liu, X. Yan, L. Fei, J. Han, and S. Midkiff,
    FSE'05 (Proc. of 2005 13th ACM SIGSOFT Symp. on the Foundations of Software Engineering), 2005.   [pdf] [website]
  41. Mining Compressed Frequent-Pattern Sets, 
    by D. Xin, J. Han, X. Yan and H. Cheng,
    VLDB'05 (Proc. of 2005 Int. Conf. on Very Large Data Bases), 2005. [pdf]
  42. Summarizing Itemset Patterns: A Profile-Based Approach, 
    by X. Yan, H. Cheng, J. Han, and D. Xin,
    SIGKDD'05a
    (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005, Best Student Paper RunnerUp. [pdf]
  43. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Jasmine Zhou, and J. Han,
    SIGKDD'05b (Proc. of 2005 Int. Conf. on Knowledge Discovery and Data Mining), 2005. [pdf]
  44. Mining Coherent Dense Subgraphs Across Massive Biological Networks for Functional Discovery, 
    by H. Hu, X. Yan, Y. Huang, J. Han, X. Jasmine Zhou,
    ISMB'05 (also Bioinformatics). [pdf] [website]
  45. Substructure Similarity Search in Graph Databases, 
    by X. Yan, P. S. Yu, and J. Han,

    SIGMOD'05 (Proc. of 2005 Int. Conf. on Management of Data), 2005. [pdf]
    Among top-ranked papers in SIGMOD'05, Invited to  ACM Transactions on Database Systems (TODS).
  46. Mining Behavior Graphs for `Backtrace' of Noncrashing Bugs, 
    by C. Liu, X. Yan, H. Yu, J. Han, and P. S. Yu,

    SDM'05a (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005. [pdf]
  47. SeqIndex: Indexing Sequences by Sequential Pattern Analysis, 
    by H. Cheng, X. Yan, and J. Han,

    SDM'05b (Proc. of 2005 SIAM Int. Conf. on Data Mining), 2005 (short paper). [pdf]
  48. Mining Closed Relational Graphs with Connectivity Constraints, 
    by X. Yan, X. Zhou, J. Han,
    ICDE'05 (Proc. of 2005 Int. Conf. on Data Engineering) (short paper). [pdf]
  49. Graph Indexing: A Frequent Structure-based Approach, 
    by X. Yan, P. S. Yu, and J. Han,
    SIGMOD'04 (Proc. of 2004 Int. Conf. on Management of Data), 2004. [pdf][dataset]
    Among top-ranked papers in SIGMOD'04, Invited to  ACM Transactions on Database Systems (TODS).
  50. IncSpan: Incremental Mining of Sequential Patterns in Large Database, 
    by H. Cheng, X. Yan, and J. Han,

    SIGKDD'04 (Proc. 2004 of the Int. Conf. on Knowledge Discovery and Data Mining), 2004. [pdf]
  51. CloseGraph: Mining Closed Frequent Graph Patterns, 
    by X. Yan and J. Han,

    SIGKDD'03 (Proc. of 2003 Int. Conf. Knowledge Discovery and Data Mining), 2003. [pdf]

    Google Scholar ranks CloseGraph as #1 for "graph pattern mining", with 140 citations. (as of Nov 25, 2007)
  52. CloSpan: Mining Closed Sequential Patterns in Large Datasets,
    by X. Yan, J. Han, and R. Afshar,

    SDM'03 (Proc. of 2003 SIAM Int. Conf. Data Mining), 2003.  [pdf]
  53. TSP: Mining Top-K Closed Sequential Patterns,
    by P. Tzvetkov, X. Yan, and J. Han,
    ICDM'03 (Proc. of 2003 Int. Conf. on Data Mining), 2003. [pdf]
  54. gSpan: Graph-Based Substructure Pattern Mining,
    by X. Yan and J. Han,
    ICDM'02 (Proc. of 2002 Int. Conf. on Data Mining) (short paper), 2002.  [pdf]
    Expanded Version, UIUC Technical Report, UIUCDCS-R-2002-2296. [pdf]
    Google Scholar ranks gSpan as #3 for "graph pattern mining", with 276 citations. (as of Nov 25, 2007)
  55. Accelerating Volume Rendering with L-Buffer,
    by X. Yan, W. Cai and J. Shi,
    CAD&Graphics'97
    , Wuhan, China, 1997.

Book Chapters

  1. Discovery of Frequent Substructures
    by X. Yan and J. Han,
    Mining Graph Data, D. Cook and L. Holder, John Wiley & Sons Inc, 2007.
  2. Discovering evolutionary classifier over high speed non-static stream,  
    by J. Yang, X. Yan, J. Han, and W. Wang,
    Advanced Methods for Knowledge Discovery from Complex Data, S. Bandyopadhyay, U. Maulik, L. Holder, D. Cook (Eds.), Springer, 2005.
  3. Mining Frequent Patterns in Data Streams at Multiple Time Granularities,
    by C. Giannella, J. Han, J. Pei, X. Yan, and P. S. Yu,
    Next Generation Data Mining, H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha (eds.),  AAAI/MIT, 2004.
  4. Sequential Pattern Mining by Pattern-Growth: Principles and Extensions,
    by J. Han, J. Pei, and X. Yan,
    Recent Advances in Data Mining and Granular Computing (Mathematical Aspects of Knowledge Discovery), W. Chu and T. Lin (eds.), Springer Verlag, 2004.

Workshop Papers, Demos, and Technical Reports

  1. EasyTicket: A Ticket Routing Recommendation Engine for Enterprise Problem Resolution,
    by Q. Shao, Y. Chen, S. Tao, X. Yan, N. Anerousis,
    Proc. of 2008 Int. Conf. on Very Large Data Bases (VLDB'08),  Auckland, New Zealand,  2008
  2. Combining near-optimal feature selection with gSpan,
    by K. Borgwardt1, X. Yan, M. Thoma, H. Cheng, A. Gretton, L. Song, A. Smola, J. Han, P. Yu, H.-P. Kriegel,
    6th Int. Workshop on Mining and Learning with Graph (MLG'08), Helsinki, Finland, 2008
  3. Entity Search: Search Directly and Holistically,
    by T. Cheng, X. Yan, K. Chang,
    Proc. of 2007 Int. Conf. on Management of Data (SIGMOD'07), Beijing, China, 2007
  4. BioArrayMine: A Software Package for Integrative Analysis of Cross-platform and Cross-species Microarray Data,  
    by F. Pan, K. Kamath, H. Hu, Y. Huang, K. Zhang, M. Xu, X. Yan, J. Han, and X. Jasmine Zhou,
    Proc. of 2005 Int. Conf. on Intelligent Systems for Molecular Biology (ISMB'05), Detroit, MI, 2005 (system demo).
  5. GraphMiner: A Structural Pattern Mining System for Large Disk-based Graph Databases and Its Applications,  
    by W. Wang, C. Wang, Y. Zhu, B. Shi, J. Pei, X. Yan, and J. Han,
    Proc. of 2005 Int. Conf. on Management of Data (SIGMOD'05), 879-881, Baltimore, MD, 2005 (system demo).
  6. Mining Hidden Community in Heterogeneous Social Networks,  
    by D. Cai, Z. Shao, X. He, X. Yan, and J. Han,
    Technical Report UIUCDCS-R-2005-2538, Department of Computer Science, University of Illinois at Urbana-Champaign, 2005.
  7. Using Data Mining for Discovering Patterns in Autonomic Storage Systems,  
    by Z. Li, S. Srinivasan, Z. Chen, Y. Zhou, P. Tzvetkov, X. Yan, and J. Han,
    ACM Workshop on Algorithms and Architectures for Self-Managing Systems, Proc. of 2003 Federated Computing Research Conference (FCRC'03), 2003.
  8. A Framework for Continuous Quantile Computation over Sensor Networks,  
    by X. Yan, J. Yang, J. Han, and W. Wang,
    Technical Report UIUCDCS-R-2003-2382, Department of Computer Science, University of Illinois at Urbana-Champaign, 2003.
  9. gSpan: Graph-Based Substructure Pattern Mining,  
    by X. Yan and J. Han,
    Technical Report UIUCDCS-R-2002-2296, Department of Computer Science, University of Illinois at Urbana-Champaign, 2002.