Ah! Excellent. I looked briefly, and on first glance it looks like those who have the topography pre-coordinated contain that link, and the others morphology-only don’t. That’s what you would expect.
Except that you have to link “Small cell carcinoma” and “Small cell carcinoma of the lung”. Because the former has the histology “property”, and the latter has the site “relationship”. How do you do that?
Look for yourself. There is a missing link. Unless I am missing something…
We were thinking of the following script:
- Take SNOMED parent of all malingnant disorders: 443392 Malignant neoplastic disease SNOMED-code 363346000 and pull all descendants from the CONCEPT_ANCESTOR TABLE
- Do the crosswalk to the NCI code through UMLS
- Pull the ICDO histology(ies) and the ICDO site(s) from the NCI Metathesaurus
- Assess coverage and quality
We can easily do 1-2. But it looks like the NCI download will take a couple days. If you want, we can divide the labor and do it together. Let me know.
No need for that, Iker. We prefer stealing over recreating by a factor of infinite to one.
Totally understood. What we wanted to do is to see how bad it is. If it is bad, we will create separate fields, if it is feasible (because the number of actually existing combos is in the 1000s, not millions) we can pre-coordinate and save on new fields. And stay backwards-compatible. This is an open-ended investigation where we collectively have to make a decision.
Please keep doing this. We need every hand on board of folks knowing this stuff.
Great! That’s our number: 10500. Totally doable. Let’s check it against real data.
Yes. That’s what I am proposing to do with the exhaustive script. That’s also what @rimma was saying.
Agreed. And the idea is to augment or extend what’s in SNOMED with self-made ICD-O-derviced concepts. Not to replace one with the other or to abandon one entirely.
We know. The debate is whether or not we should pre-coordinate (permute all possible and use all useful) combinations.
It actually is, even though it’s a well-kept secret. The pre-coordinated concepts of a diagnosis (histology-site-grade) have links to the components. In fact, SNOMED does that with all diseases, not just neoplasms. But regarding quality and comprehensive coverage - we need to evaluate.
Bad boy!!! Don’t do anything without your friends here! What did you end up using?