Duplication of Implicitome tuples records

Issue #79 resolved
Richard Bruskiewich created an issue

While validating the loading B concept data into the implicitome, it was noted that a subset of Implicaitome records currently have duplicate records, generally, side-by-side in the database. Perhaps these duplications need to be removed, taking care that the entry left behind has a properly initialized B concept record.

Comments (7)

  1. b

    @rbruskiewich need you to resolve this one or make the call on foxing it or not. Is it fixed. From the user side, I don't see evidence of this.

  2. Richard Bruskiewich reporter

    There are about 1.3 million duplicates (out of 200 million entries). I have the SQL to detect them. The challenge is, as always, dealing with the huge table size with some sensible code that works in the background(?). There seem to be many assumed variables: MySQL configuration (for local access, permissions, etc.), limitations to internal working tables, timeouts of transaction links, etc. Probably need to consolidate efforts on this particular issue with the B concept update issue (somehow).

  3. Richard Bruskiewich reporter

    The B concept loading script (Issue #56) is now designed to filter out duplication in the Implicit relations data.

  4. Log in to comment