Project Description
Data Extracting SDK can help you to extract information from the web resources in a simple way.

This framework allows you to extract different data from text or web sources, analyze text content, extract different information and write your own data mining & extracting applications and services.

Install from Nuget

Install-Package DataExtractingSDK

Features

  • DOM analysis
  • Emails, phones, images, links extracting
  • websites meta information extracting
  • rich data parser possibilities
  • websites screenshots extracting
  • rich HTML processing
  • and more

Simple Example

This sample extracts all emails from the given web page:

using System;
using System.Data.Extracting;

namespace VS2010Demo
{
    class Program
    {
        static void Main(string[] args)
        {
            DataExtractor ext = new DataExtractor(new Uri("http://msug.vn.ua/"), DataTypes.Email);
            var results = ext.GetExtractedResults();

            foreach (var item in results)
            {
                Console.WriteLine("{0}: {1}", item.GroupName, item.Value);
            }

            Console.Write("\nPress any key to exit...");
            Console.ReadKey();
        }
    }
}

Last edited Mar 30, 2012 at 2:01 PM by akrakovetsky, version 70