menu
{ "item_title" : "Clustering for Classification", "item_author" : [" Reuben Evans "], "item_description" : "Advances in technology have provided industry with an array of de-vices for collecting data. The frequency and scale of data collection means that there are now many large datasets being generated. To find patterns in these datasets it would be useful to be able to apply modern methods of classification such as support vector machines. Unfortunately these methods are computationally expensive, quadra-tic in the number of data points in fact, and so cannot be applied directly. This book proposes a framework whereby a variety of clustering methods can be used to summarise datasets, that is, reduce them to a smaller but still representative dataset so that these advanced me-thods can be applied. It compares the results of using this framework against using random selection on a large number of classification and regression problems. Results show that the clustered datasets are on average fifty percent smaller than the original datasets without loss of classification accuracy which is significantly better than ran-dom selection. They also show that there is no free lunch, for each dataset it is important to choose a clustering method carefully.", "item_img_path" : "https://covers3.booksamillion.com/covers/bam/3/63/903/163/3639031636_b.jpg", "price_data" : { "retail_price" : "52.92", "online_price" : "52.92", "our_price" : "52.92", "club_price" : "52.92", "savings_pct" : "0", "savings_amt" : "0.00", "club_savings_pct" : "0", "club_savings_amt" : "0.00", "discount_pct" : "10", "store_price" : "" } }
Clustering for Classification|Reuben Evans

Clustering for Classification : Using Standard Clustering Methods

local_shippingShip to Me
In Stock.
FREE Shipping for Club Members help

Overview

Advances in technology have provided industry with an array of de-vices for collecting data. The frequency and scale of data collection means that there are now many large datasets being generated. To find patterns in these datasets it would be useful to be able to apply modern methods of classification such as support vector machines. Unfortunately these methods are computationally expensive, quadra-tic in the number of data points in fact, and so cannot be applied directly. This book proposes a framework whereby a variety of clustering methods can be used to summarise datasets, that is, reduce them to a smaller but still representative dataset so that these advanced me-thods can be applied. It compares the results of using this framework against using random selection on a large number of classification and regression problems. Results show that the clustered datasets are on average fifty percent smaller than the original datasets without loss of classification accuracy which is significantly better than ran-dom selection. They also show that there is no free lunch, for each dataset it is important to choose a clustering method carefully.

This item is Non-Returnable

Details

  • ISBN-13: 9783639031638
  • ISBN-10: 3639031636
  • Publisher: VDM Verlag Dr. Mueller E.K.
  • Publish Date: June 2008
  • Dimensions: 9 x 6 x 0.22 inches
  • Shipping Weight: 0.34 pounds
  • Page Count: 108

Related Categories

You May Also Like...

    1

BAM Customer Reviews