Feature extraction

Feature extraction

In pattern recognition and in image processing, Feature extraction is a special form of dimensionality reduction.

When the input data to an algorithm is too large to be processed and it is suspected to be notoriously redundant (much data, but not much information) then the input data will be transformed into a reduced representation set of features (also named features vector). Transforming the input data into the set of features is called "features extraction". If the features extracted are carefully chosen it is expected that the features set will extract the relevant information from the input data in order to perform the desired task using this reduced representation instead of the full size input.

General

Feature extraction involves simplifying the amount of resources required to describe a large set of data accurately. When performing analysis of complex data one of the major problems stems from the number of variables involved. Analysis with a large number of variables generally requires a large amount of memory and computation power or a classification algorithm which overfits the training sample and generalizes poorly to new samples. Feature extraction is a general term for methods of constructing combinations of the variables to get around these problems while still describing the data with sufficient accuracy.

Best results are achieved when an expert constructs a set of application-dependent features. Nevertheless, if no such expert knowledge is available general dimensionality reduction techniques may help. These include:
* Principal components analysis
* Semidefinite embedding
* Multifactor dimensionality reduction
* Nonlinear dimensionality reduction
* Isomap
* Kernel PCA
* Latent semantic analysis
* Partial least squares

Image processing

It can be used in the area of image processing which involves using algorithms to detect and isolate various desired portions or shapes (features) of a digitized image or video stream. It is particularly important in the area of Optical Character Recognition.

Low-level

* Edge detection
* Corner detection
* Blob detection
* Ridge detection
* Scale-invariant feature transform

Curvature

*Edge direction, changing intensity, autocorrelation.

Image motion

*Motion detection. Area based, differential approach. Optical flow.

hape Based

Thresholding

Blob extraction

Template matching

Hough transform

*Lines
*Circles/Ellipse
*Arbitrary shapes (Generalized Hough Transform)

Flexible methods

*Deformable, parameterized shapes
*Active contours (snakes)

References

* [http://jmlr.csail.mit.edu/papers/special/feature03.html JMLR Special Issue on Variable and Feature Selection]

ee also

*Cluster analysis
*Dimensionality reduction
*Feature detection
*Feature selection
*Data mining
*Connected component labeling
*Segmentation_(image_processing)


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • feature extraction —    The optical character recognition (OCR) technique used by     omnifont OCR software …   IT glossary of terms, acronyms and abbreviations

  • Feature selection — Feature selection, also known as variable selection, feature reduction, attribute selection or variable subset selection, is the technique, commonly used in machine learning, of selecting a subset of relevant features for building robust learning …   Wikipedia

  • Feature (computer vision) — In computer vision and image processing the concept of feature is used to denote a piece of information which is relevant for solving the computational task related to a certain application. More specifically, features can refer to* the result of …   Wikipedia

  • Feature detection (computer vision) — In computer vision and image processing the concept of feature detection refers to methods that aim at computing abstractions of image information and making local decisions at every image point whether there is an image feature of a given type… …   Wikipedia

  • Feature vector — In pattern recognition and machine learning, a feature vector is an n dimensional vector of numerical features that represent some object. Many algorithms in machine learning require a numerical representation of objects, since such… …   Wikipedia

  • Scale-invariant feature transform — Feature detection Output of a typical corner detection algorithm …   Wikipedia

  • Feature recognition — The term feature does not imply the same meaning in different engineering disciplines. This has resulted in several ambiguous definitions for feature. A feature, in computer aided design (CAD) software, can be called a region of a part with some… …   Wikipedia

  • Extraction de caractéristique en vision par ordinateur — Pour les articles homonymes, voir extraction de caractéristique (homonymie). En vision par ordinateur, l extraction de caractéristiques visuelles (ou visual features extraction en anglais) consiste en des transformations mathématiques calculées… …   Wikipédia en Français

  • Scale-invariant feature transform — Exemple de résultat de la comparaison de deux images par la méthode SIFT (Fantasia ou Jeu de la poudre, devant la porte d’entrée de la ville de Méquinez, par Eug …   Wikipédia en Français

  • Shale oil extraction — Shell s experimental in situ shale oil facility, Piceance Basin, Colorado, United States Process type Chemical Industrial sector(s) Chemical industry, oil indus …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”