Wiki

Clone wiki

OYSTER / Home

Oyster Home Page

Oyster Master TOC

Welcome

Welcome to the OYSTER enhancement project. This repository contains the original OYSTER code with some fixes. It will be used for maintenance on the existing code base.

Entity Resolution is the process of determining if two references to real-world objects in an information systems are referring to the same object or to different objects. OYSTER (Open sYSTem for Entity Resolution) is an a full end-to-end entity resolution system that supports inverted index blocking, configurable deterministic and probabilistic matching rules, transitive closure of pairwise linking, and the creation and maintenance of an entity identity knowledgebase (IKB) with persistent entity identifiers. In addition to creating an initial IKB in its "Identity Capture" configuration, OYSTER also supports incremental updates of the entity identity knowledgebase with new reference information through its "Identity Update" configuration. Through its "Assertion Configurations" OYSTER allows false positive and false negative clusters in the IKB to be corrected by splitting and merging entity identity clusters as appropriate. OYSTER also supports traceability and data provenance. Each reference in the IKB is labeled with information indicating the original source of the reference and the process and rule which brought the reference into a particular entity identity cluster.

SourceForge

The list of wiki pages can help you start to download, install and run OYSTER in your laptop:

  • To learn more information about OYSTER, please check the wiki page OYSTER Overview
  • OYSTER Demonstration Run page contains the step by step instructions of OYSTER Demonstration. Please follow the instructions to install and run OYSTER configurations.
  • The latest version of the OYSTER jar file is available in the Downloads section via the link in the left menu.
  • The OYSTER Reference Guide is intended for use by OYSTER users who already have an understanding of the basics of OYSTER. It is designed to answer configuration questions regarding available options in the OYSTER XML files.
  • In OYSTER User Guide you will learn about the basic concepts of Entity Resolution (ER). This document touches on the five activates of ER, the four architectures of ER, and the four techniques used to determine equivalence in ER.

Updated