Needlebase: brief description

Needlebase: brief description
Светлана Комарова

Светлана Комарова

Автор статьи. Системный администратор, Oracle DBA. Информационные технологии, интернет, телеком. Подробнее.

Needlebase provides a point-and-click interface for extracting structured information from web pages. As a user, you select elements on an example page that contain the data you’re interested in, and the tool then uses the patterns you’ve defined to pull out information from other pages on a site with a similar structure. For example, you might want to extract product names and prices from a shopping site. With the tool, you could find a single product page, select the product name and price, and then the same elements would be pulled for every other page it crawled from the site. It relies on the fact that most web pages are generated by combining templates with information retrieved from a database, and so have a very consistent structure.

Once you’ve gathered the data, it offers some features that are a bit like Google Refine’s for de-duplicating and cleaning up the data. All in all, it’s a very powerful tool for turning web content into structured information, with a very approachable interface.

Вас заинтересует / Intresting for you:

ASP.NET MVC as a Service Frame...
ASP.NET MVC as a Service Frame... 1689 views Zero Cool Wed, 31 Oct 2018, 09:44:10
CSS: Flex Container Properties...
CSS: Flex Container Properties... 7369 views Боба Sat, 09 Nov 2019, 07:41:19
WordPress Ecommerce: Plugin Ch...
WordPress Ecommerce: Plugin Ch... 722 views dbstalker Mon, 31 Jan 2022, 16:56:47
JavaScript for game designer: ...
JavaScript for game designer: ... 1083 views Antoni Tue, 27 Nov 2018, 14:12:32
Comments (0)
There are no comments posted here yet
Leave your comments
Posting as Guest
×
Suggested Locations