DataSets - accord-net/framework GitHub Wiki

The framework comes with a collection dataset classes that can be used to retrieve popular datasets or data collections from the web into your development machine.

Test Images

The TestImages class can be used to retrieve common image processing test images, such as Lena Söderberg's picture:

// Let's load an example image, such as Lena,
// from a standard dataset of example images:
var images = new TestImages(path: localPath);
Bitmap lena = images["lena.bmp"];

// Create a new Histogram of Oriented Gradients with the default parameter values:
var hog = new HistogramsOfOrientedGradients(numberOfBins: 9, blockSize: 3, cellSize: 6);

// Use it to extract descriptors from the Lena image:
List<double[]> descriptors = hog.ProcessImage(lena);

// Now those descriptors can be used to represent the image itself, such
// as for example, in the Bag-of-Visual-Words approach for classification.

Test Videos

The TestVideos class provides small sample clips to test pedestrian detection, face recognition and tracking.

// Let's test the tracker using a sample video from 
// the collection of test videos in the framework:
TestVideos ds = new TestVideos(basePath);
string fileName = ds["walking.mp4"];

Please check the documentation page for the TestVideos class for a more complete example demonstrating how to use a sample video to test an object tracker.

Free Spoken Digits Dataset

The FreeSpokenDigitsDataset class can be used to retrieve audio samples from the Free Spoken Digits Dataset project.

// Create a new Bag-of-Audio-Words (BoW) model
var bow = BagOfAudioWords.Create(numberOfWords: 32);
// Note: a simple BoW model can also be created using
// var bow = new BagOfAudioWords(numberOfWords: 10);

// Get some training images
FreeSpokenDigitsDataset fsdd = new FreeSpokenDigitsDataset(basePath);
string[] trainFileNames = fsdd.Training.LocalPaths;
int[] trainOutputs = fsdd.Training.Digits;

// Compute the model
bow.Learn(trainFileNames);

For a more complete example on how to perform audio classification using Bag-of-Audio-Words, please refer to the documentation for the BagOfAudioWords class.

Others

For a list of the datasets currently available in the framework, please refer to the Accord.DataSets documentation page. Those datasets should include, but are not limited to:

⚠️ **GitHub.com Fallback** ⚠️