![]() ![]() : We have released the WDC Training Dataset and Gold Standard for Large-Scale Product Matching.: We have released a new version of the RDFa, Microdata, Microformat, and Embedded JSON-LD data corpus extracted from the November 2018 Common Crawl corpus.: Paper about The WDC training dataset and gold standard for large-scale product matching accepted at ECNLP Workshop at WWW2019 conference in San Francisco.: Journal Article about Using the Semantic Web as a Source of Training Data has been published by the Datenbank-Spektrum Journal.: We have released the Time-Dependent Ground Truth (TDGT), a dataset covering time-dependent data from various domains.: We have released the Web Tables for Long-Tail Entity Extraction (T4LTE) dataset, the first gold standard for the task of long-tail entity extraction from web tables.: Version 2.0 of the WDC Product Data Corpus and Gold Standard for Large-Scale Product Matching released.The WDC Product Data Corpus and Gold Standard V2.0 will be used as training and evaluation resources for the Product Matching task.Įmbedded JSON-LD data sets extracted from the November 2019 Common Crawl corpus. : The CfP for the Semantic Web ISWC2020 "Mining the Web of HTML-embedded Product Data" has been announced.: We will present the paper Using Annotations for Training and Maintaining Product Matchers using Version 2.0 of the WDC Product Data Corpus and Gold Standard for Large-Scale Product Matching at the WIMS2020 conference.: The paper Intermediate Training of BERT for Product Matching using Version 2.0 of the WDC Product Data Corpus and Gold Standard for Large-Scale Product Matching has been accepted at the DI2KG workshop held in conjunction with VLDB2020.: We have released the WDC RDFa, Microdata, Microformat, andĮmbedded JSON-LD data sets extracted from the September 2020 Common Crawl corpus. ![]()
0 Comments
Leave a Reply. |