Knowee
Questions
Features
Study Tools

Data mining is a process of extracting valid, previously unknown, and ultimately comprehensible information from large datasets and using it for organizational decision making [10]. However, there a lot of problems exist in mining data in large datasets such as data redundancy, the value of attributes is not specific, data is not complete and outlier [13].Outlier is defined as an observation that deviates too much from other observations that it arouses suspicions that it was generated by a different mechanism from other observations [21]. The identification of outliers can provide useful, sufficient and meaningful knowledge and number of applications in areas such as climatology, ecology public health, transportation, and location based services. Recently, a few studies have been conducted on outlier detection for large dataset [4]. However, most existing study concentrate on the algorithm based on special background, compared with outlier identification approach is comparatively less. This paper mainly discusses about outlier detection approaches from data mining perspective. The inherent idea is to research and compare achieving mechanism of those approaches to determine which approach is better based on special dataset and different background. if nothing seems compatible or relevant just tell me

Question

Data mining is a process of extracting valid, previously unknown, and ultimately comprehensible information from large datasets and using it for organizational decision making [10]. However, there a lot of problems exist in mining data in large datasets such as data redundancy, the value of attributes is not specific, data is not complete and outlier [13].Outlier is defined as an observation that deviates too much from other observations that it arouses suspicions that it was generated by a different mechanism from other observations [21]. The identification of outliers can provide useful, sufficient and meaningful knowledge and number of applications in areas such as climatology, ecology public health, transportation, and location based services. Recently, a few studies have been conducted on outlier detection for large dataset [4]. However, most existing study concentrate on the algorithm based on special background, compared with outlier identification approach is comparatively less. This paper mainly discusses about outlier detection approaches from data mining perspective. The inherent idea is to research and compare achieving mechanism of those approaches to determine which approach is better based on special dataset and different background. if nothing seems compatible or relevant just tell me

...expand
🧐 Not the exact question you are looking for?Go ask a question

Solution

This passage discusses the concept of data mining, which is the process of extracting valuable, previously unknown, and comprehensible information from large datasets for organizational decision-making. However, the author notes that there are several challenges associated with mining data from large datasets, including data redundancy, non-specific attribute values, incomplete data, and outliers.

An outlier is defined as an observation that deviates significantly from other observations, raising suspicions that it was generated by a different mechanism. Identifying outliers can provide useful and meaningful knowledge and has applications in various fields such as climatology, ecology, public health, transportation, and location-based services.

The author mentions that while there have been some recent studies on outlier detection for large datasets, most existing studies focus on algorithms based on specific backgrounds. In contrast, there is comparatively less focus on the approach to outlier identification.

The main focus of this paper is to discuss outlier detection approaches from a data mining perspective. The underlying idea is to research and compare the mechanisms of these approaches to determine which approach is better based on a specific dataset and different backgrounds.

This problem has been solved

Similar Questions

Suppose that you are employed as a data mining consultant for an Internet search engine company. Describe how data mining can help the company by giving specific examples of how techniques, such as clustering, classification, association rule mining, and anomaly detection can be applied. Discuss whether or not each of the following activities is a data mining task. Dividing the customers of a company according to their gender. Dividing the customers of a company according to their profitability. Computing the total sales of a company. Sorting a student database based on student identification numbers. Predicting the outcomes of tossing a (fair) pair of dice. Predicting the future stock price of a company using historical records. Monitoring the heart rate of a patient for abnormalities. Monitoring seismic waves for earthquake activities. Extracting the frequencies of a sound wave. For each of the following data sets, explain whether or not data privacy is an important issue. Census data collected from 1900-1950. IP addresses and visit times of Web users who visit your Website. Images from Earth-orbiting satellites. Names and addresses of people from the telephone book. Names and email addresses collected from the Web.

Definition of Data Mining

A huge collection of the information or data accumulated form several different sources is known as ________.a.Data Miningb.Data mining and Data managementc.Data Warehoused.Data Management

Which of the following machine learning techniques helps in detecting the outliers in data?

A goal of data mining includes which of the following?. To explain some observed event or condition To confirm that data exists To analyze data for expected relationships To create a new data warehouse

1/3

Upgrade your grade with Knowee

Get personalized homework help. Review tough concepts in more detail, or go deeper into your topic by exploring other relevant questions.