Category Archives: eHRC

Debugging lessons

For the past 6 weeks or so I’ve been trying to track down an elusive bug in my SNOMED classifier. The difficulty has been that it only manifest with very large input sets (I only managed to reduce it down to about 350,000 concept definitions). This meant lots of large data-structures and long [...]

Very high speed ontology classification

You heard it here first, SNOMED CT classified in 440s (under 7.5 min) based on an optimised version of the Desden Algorithm written in Java.
This is more than three times faster than the best known published result.
A caveat:

correctness has been checked on small number of examples only

Benchmarking shows (as expected) quadratic time O(n2) and linear [...]