HyperAI超神经

摘要

网页信息提取（WIE）是创建知识库的重要步骤。传统的方法利用网站的文档对象模型（DOM）树来实现这一目标。然而，DOM树的使用带来了显著的挑战，因为上下文和外观是以抽象的方式编码的。为了解决这一问题，我们提出将WIE重新定义为一种具有上下文感知能力的网页对象检测任务。具体而言，我们开发了一种基于上下文感知视觉注意力（CoVA）的检测流程，该流程结合了外观特征和DOM树中的语法结构。为了研究这种方法，我们收集了一个新的大规模电子商务网站数据集，并手动为每个网页元素标注了四个标签：产品价格、产品标题、产品图像和背景。在该数据集上，我们展示了所提出的CoVA方法是一种新的具有挑战性的基准方法，其性能优于先前的最先进方法。

摘要

Anurendra Kumar* [email protected] Keval Morabia* [email protected] Jingjin Wang [email protected] Kevin Chen-Chuan Chang [email protected] Alexander Schwing [email protected]

摘要

用 AI 构建 AI

HyperAI Newsletters

Anurendra Kumar* [email protected] Keval Morabia* [email protected] Jingjin Wang [email protected] Kevin Chen-Chuan Chang [email protected] Alexander Schwing [email protected]

摘要

用 AI 构建 AI

HyperAI Newsletters

Anurendra Kumar* [email protected] Keval Morabia* [email protected] Jingjin Wang [email protected] Kevin Chen-Chuan Chang [email protected] Alexander Schwing [email protected]

摘要

用 AI 构建 AI

HyperAI Newsletters

Command Palette

CoVA：用于网页信息提取的情境感知视觉注意力机制

Anurendra Kumar* [email protected] Keval Morabia* [email protected] Jingjin Wang [email protected] Kevin Chen-Chuan Chang [email protected] Alexander Schwing [email protected]

摘要

用 AI 构建 AI

HyperAI Newsletters

Command Palette

CoVA：用于网页信息提取的情境感知视觉注意力机制

Anurendra Kumar* [email protected] Keval Morabia* [email protected] Jingjin Wang [email protected] Kevin Chen-Chuan Chang [email protected] Alexander Schwing [email protected]

摘要

用 AI 构建 AI

HyperAI Newsletters

Command Palette

CoVA：用于网页信息提取的情境感知视觉注意力机制

Anurendra Kumar* [email protected] Keval Morabia* [email protected] Jingjin Wang [email protected] Kevin Chen-Chuan Chang [email protected] Alexander Schwing [email protected]

摘要

用 AI 构建 AI

HyperAI Newsletters