开源项目社区 | 当前位置 : |
|
www.trustie.net/open_source_projects | 主页 > 开源项目社区 > ariel |
ariel
|
0 | 2 | 33 |
贡献者 | 讨论 | 代码提交 |
概述
Ariel intends to assist in extracting information from semi-structured
documents including (but not in any way limited to) web pages. Although you
may use libraries such as Hpricot or Rubyful Soup, or even plain Regular
Expressions to achieve the same goal, Ariel approaches the problem very
differently. Ariel relies on the user labeling examples of the data they
want to extract, and then finds patterns across several such labeled
examples in order to produce a set of general rules for extracting this
information from any similar document.
创建时间:2014-05-08 04:08