- Commercial data mining systems have little in common
· Different data mining functionality or methodology
· May even work with completely different kinds of data sets
- Need multiple dimensional view in selection
- Data types: relational, transactional, text, time sequence, spatial?
- System issues
· running on only one or on several operating systems?
· a client/server architecture?
· Provide Web-based interfaces and allow XML data as input and/or output?
· Data sources
· ASCII text files, multiple relational data sources
· support ODBC connections (OLE DB, JDBC)?
· Data mining functions and methodologies
· One vs. multiple data mining functions
· One vs. variety of methods per function
· More data mining functions and methods per function provide the user with greater flexibility and analysis power
· Coupling with DB and/or data warehouse systems
· Four forms of coupling: no coupling, loose coupling, semitight coupling, and tight coupling
· Ideally, a data mining system should be tightly coupled with a database system
· Scalability
· Row (or database size) scalability
· Column (or dimension) scalability
· Curse of dimensionality: it is much more challenging to make a system column scalable that row scalable
· Visualization tools
· “A picture is worth a thousand words”
· Visualization categories: data visualization, mining result visualization, mining process visualization, and visual data mining
- Data mining query language and graphical user interface
· Easy-to-use and high-quality graphical user interface
· Essential for user-guided, highly interactive data mining
0 komentar:
Posting Komentar