Data Mining as a Practical Science

by Ralph Dawson.

Share
|
Homepage | Submit your article | Contact | TOS
More articles on business it  

You are here: Categories » Business » Business IT

Data mining is located at the crossing of different disciplines. Its roots are to be found in the data analysis techniques that were originally the main object of the study of statistics. The fundamental ideas at the basis of estimation theory, classification, clustering, sampling theory, are indeed still one of the major ingredients of data mining. But other methods and techniques have been added to the toolbox of the data analyst, extending the limits of the classical parametric statistics with more complex models, reaching their maturity with the actual state of knowledge on decision trees, neural networks, support vector machines, just to mention a few. In addition, the need to organize and manage large bodies of data has required the deployment of computer science techniques for database management, query optimization, optimal coding of algorithms, and other tasks devoted to the storing of information in the memory of computers and to the efficient execution of algorithms.

A common trademark of the modern approaches is the formalization of estimation and classification problems arising in data mining as mathematical optimization problems, and the use of consistent algorithmic techniques to determine optimal solutions for these problems. Such methodological framework has been strongly supported by applied mathematics and operations research (OR), a scientific discipline characterized by a deep integration of mathematical theory and practical problems. A significant evidence of the role of OR in data mining is the contribution that nonlinear and integer optimization methods have given to the solution of the error minimization functions that need to be optimized to train neural networks and support vector machines. Analogously, integer programming and combinatorial optimization have been largely used to solve problems arising in the identification of synthetic rule-based classification models and in the selection of optimal subsets of features in large datasets.

Despite its strong methodological characterization, data mining cannot be successfully applied without a deep understanding of the semantic of each specific problem, which often requires the customization of existing methods or the development of ad hoc techniques, partially based on already existing algorithms. To some extent, the real challenge that the data mining practitioner has to face is the selection, among many different methods and approaches, of the one that best serves the scope of the task considered, often assessing a compromise between the complexity of the chosen model and its generalization capability.

Leave a comment or ask a question
Total comments: 0

Business IT Disclaimer

  • The e-articles directory is not responsible for any and all copyright infringements by writers and authors. If you suspect the information contained by this page for any copyright infringements, please contact us to investigate the issue
Find Call center and BPO jobs in India - There are many kinds of jobs in BPO as there are in any other industry. The prevalence of BPO adheres to the existing need of different industries, like manufacturing, servicing industries, consume (more...)
IT Jobs opportunities for the young talents - In today's digitally emerged and technically enhanced business world there will be none who doesn't know about Information Technology. Even an illiterate says its computers related stuff as an answ (more...)
Why Structural 3D Modeling is Critical for Industrial Designs - As Industrial designs intend to create and execute design solutions towards problems of form, usability, engineering, brand development and sales, 3D modeling has a crucial role to play. 3D modelin (more...)
Advantages of Outsourcing - There are many functions that are being typically outsourced these days. Take for instance the case of promotional materials that are so very important for the success of any enterprise, big or s (more...)
IT SERVICES - "An IT Services provider is an entity that provides services to other IT service." IT Services, as defined by the Information Technology Association of America (ITAA), is "the study, design (more...)
Data Entry Offshore Services - Data Entry Services is a fast growing industry. The universe of business is dynamic, fast paced, and in constant flux. In such an atmosphere the accessibility of precise, thorough information is a (more...)
Modern Day Use of UID Labels - Innovation plays a significant role in driving business, which in turn drives our economy. Although an indelible mark is being made on nearly every aspect of modern life, few of us ever stop to ref (more...)
Why people (especially Oracle DBAs) changes job so frequently in IT field - Important reasons of why people (Specially Oracle DBA) change their job frequently in I.T field. Important factors and reasons behind jumping trend and frequent change job in I (more...)
Print your documents with any printer across the world - As technology is advancing new gadgets are getting discovered to make our life simpler. Computers and digital gadgets have made our life effortless. We use Internet to contact our friends and gro (more...)
Information Technology for Project Management Automation - Although project management systems have been evolving from mainframe -based, big-iron programs into microcomputer-based, GUI (graphical user interface) programs, they are still in their infancy (more...)

 
free content
    Copyright © 2006 - 2012 e-articles.info.
The texts, articles and tutorials in the directory are property of their respective owners and authors.