|
Short
Biography
Dr Tao Li is currently an associate professor in the School of Computer Science, Florida International University. He
received his Ph.D. in computer science from the Department of Computer Science, University of Rochester in 2004. (My old homepage at Rochester).
Dr Tao Li's research explores two related topics on learning from
data---how to efficiently discover useful patterns and how to effectively
retrieve information. The interests lie broadly in data mining and machine
learning studying both the algorithmic and application issues. The
algorithmic aspects involve developing new scalable, efficient and
interactive algorithms that can handle very large databases. The underlying
techniques studied include clustering, classification, semi-supervised
learning, similarity and temporal pattern discovery. The application issues
focus on actual implementation and usage of the algorithms on a variety of
real applications with different characteristics including bioinformatics,
text mining, music information retrieval and event mining for computer
system management.
 |
A new book entitled
"Music Data Mining" has appeared from CRC Press. Here
is
the publisher link
|
|
Awards
-
Xerox University
Affairs Committee (UAC) Award, 2011-2014
- IBM Scalable Data
Analytics Innovation Award, 2010
-
NSF Career Award,
2006-2011
- Excellence in Student
Mentoring Award, School of Computing and Information Sciences, Florida
International University, 2009
- Excellence in
Research Award, Florida International University, 2009
- Excellence in
Faculty Scholarship, Florida International University, 2008
- IBM Faculty Award,
2005, 2007 & 2008
- IBM Shared
University Research (SUR) Award, 2005
-
Xerox University
Affairs Committee (UAC) Award, 2005-2008
-
Excellence in
Research Award, School of Computer Science, Florida International
University, 2005
- Comparative Document
Summarization via Discriminative Sentence Selection,
ACM
Transactions on Knowledge Discovery from Data (ACM TKDD), 2012, in
press.
-
Dynamic Query Forms for Database Queries,
IEEE Transactions on Knowledge and
Data Engineering (TKDE), 2013, in press.
- A Learning Approach to
SQL Query Results Ranking Using Skyline and Users' Current Navigational
Behavior.
IEEE Transactions on Knowledge and
Data Engineering (TKDE), 2012, in press.
-
Discovering Lag Intervals for Temporal Dependencies,
In SIGKDD 2012: 633-641, 2012.
-
MEET - A Generalized
Framework for Reciprocal Recommendation, In
CIKM 2012.
-
Generating Event
Storyline from Microblogs, In
CIKM 2012.
-
Self-adaptive Cloud Capacity Planning,
In IEEE SCC 2012:
73-80, 2012.
-
Generating Pictorial Storylines via
Minimum-Weight Connected Dominating Set Approximation in Multi-view
Graphs, In
AAAI 2012.
-
Optimizing System Monitoring Configurations for
Non-Actionable Alerts, In
IEEE
NOMS, 2012.
-
Integrating Document
Clustering and Multi-Document Summarization,
ACM
Transactions on Knowledge Discovery from Data (ACM TKDD), 5(3):
Article 14,
2011.
-
Applying Data Mining Techniques to
Address Disaster Information Management Challenges on Mobile Devices,
In
SIGKDD
2011, 283-291, 2011.
(Online
Demo)
-
Combining File Content and File
Relations for Cloud Based Malware Detection,
In SIGKDD
2011, 222-230, 2011.
(Online
Demo)
-
SCENE: A Scalable Two-stage
Personalized News Recommendation System,
In SIGIR 2011:125-134.
-
Extending Consensus
Clustering to Explore Multiple Clustering Views,
In
SDM 2011: 920-931.
-
LogSig: Generating System
Events from Raw Textual Logs, In
CIKM 2011.
-
Natural Event Summarization,
In
CIKM 2011: 765-774.
-
ASAP: A Self-Adaptive Prediction System for Instant Cloud
Resource Demand Provisioning, In
ICDM 2011.
-
Semi-supervised Hierarchical Clustering, In
ICDM 2011.
-
Learning to Rank for Query-focused Multi-document Summarization,
In
ICDM 2011.
- Community Discovery Using Nonnegative Matrix Factorization,
Data Mining and Knowledge
Discovery, 22(3): 493-521, 2011.
-
A Non-negative
Matrix Factorization Based Approach for Active Dual Supervision from
Document and Word Labels, In
EMNLP 2011,
2011.
-
Integrating Clustering and Multi-Document Summarization by Bi-mixture
Probabilistic Latent Semantic Analysis (PLSA) with Sentence Bases,
In
AAAI-11,
2011.
- LogTree:
A Framework for Generating System Events from Raw Textual Logs,In
ICDM 2010, 491-500, 2010.
-
Weighted Feature
Subset Non-Negative Matrix Factorization and its Applications to
Document Understanding,
In
ICDM 2010, 541-550, 2010.
-
Binary Matrix Factorization for Analyzing Gene
Expression Data,
Data
Mining and Knowledge Discovery. 20(1): 28-52, 2010.
-
Automatic Malware Categorization Using Cluster
Ensemble,
In
SIGKDD 2010: 95-104, 2010.
-
Using Data Mining Techniques to Address Critical
Information Exchange Needs in Disaster Affected Public-Private Networks,
In
SIGKDD 2010:
125-134, 2010.
-
Convex and Semi-Nonnegative Matrix
Factorizations,
IEEE Trans. Pattern Anal. Mach. Intell. 32(1): 45-55, 2010.
-
Bridging Domains with Words: Opinion Analysis
with Matrix Tri-factorizations, In
SIAM
DM 2010: 293-302.
-
A Non-negative Matrix Tri-factorization Approach to
Sentiment Classification with Lexical Prior Knowledge,
In
ACL 2009: 244-252.
-
Intelligent File Scoring System for Malware
Detection from the Gray List,
In
SIGKDD 2009: 1385-1394.
- Semi-Supervised Multi-Task Learning with Task Regularizations, In
ICDM 2009:562-568.
-
Knowledge
Transformation for Cross-domain Sentiment Classification,
In
SIGIR 2009: 716-717.
-
Generalized
Cluster Aggregation,
In
IJCAI 2009: 1279-1284.
- Dynamic Active Probing of Helpdesk
Databases, In VLDB
2008: 748-760.
-
Simultaneous Tensor Subspace Selection and Clustering: The Equivalence of
High Order SVD and K-Means Clustering, In
SIGKDD 2008: 327-335.
-
Multi-Document Summarization via Sentence-Level
Semantic Analysis and Symmetric Matrix Factorization,
in SIGIR 2008: 187-194.
- Knowledge
Transformation from Word Space to Document Space, in
SIGIR 2008: 307-314.
- On the
Equivalence Between Nonnegative Matrix Factorization and Probabilistic
Latent Semantic Indexing, in:
Computational Statistics and Data Analysis,
52(8): 3913-3927, 2008.
-
Semi-Supervised Clustering via Matrix Factorization, In
SIAM DM 2008: 1-12.
-
Weighted Consensus Clustering, In
SIAM DM 2008: 798-809.
-
Solving
Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix
Factorization, In
ICDM 2007: 577-582.
-
Adaptive Dimension Reduction Using Discriminant Analysis and K-means
Clustering, In
ICML 2007:
521-528.
-
Addressing Diverse User Preferences in SQL-Query-Result Navigation,
In SIGMOD 2007: 641-652.
-
A Learning
Framework using Green's Function and Kernel Regularization with Application
for Recommender System, In SIGKDD 2007: 260-269.
-
Orthogonal Nonnegative Matrix Tri-factorizations for Clustering,
In SIGKDD 2006: 126-135.
- The Relationships among Various Nonnegative Matrix
Factorization Methods for Clustering, In
ICDM 2006: 362-371.
-
Towards
Intelligent Music Retrieval.
IEEE
Transactions on Multimedia, 8(3): 564-574 (2006).
-
A Unified
View On Clustering Binary Data,
Machine Learning,62(3):
199-215 (2006).
-
An Integrated
Framework on Mining Logs Files for Computing System Management,In
SIGKDD 2005: 776-781.
-
A General Model
for Clustering Binary Data, In SIGKDD 2005: 188-197.
-
Document
Clustering via Adaptive Subspace Iteration, In
SIGIR 2004: 218-225.
-
Entropy-Based
Criterion in Categorical Clustering, In
ICML 2004: 536-543.
-
A Comparative
Study of Feature Selection and Multiclass Classification Methods for Tissue
Classification Based on Gene Expression,
Bioinformatics,
20(15):2429-37, 2004.
-
A
Comparative Study on Content-Based Music Genre Classification, In
SIGIR 2003: 282-289.
Meeting Time: Tuesday: 19:50pm -- 22:30pm
Meeting Room: GL139
Meeting Time: Tuesday and Thursday: 12:30pm -- 13:45pm
Meeting Room: GL139
Meeting Time: Online
Other Links:
|