openrem / OpenREM / issues / #480 - Create RDSR from Toshiba CT dose summary images and import — Bitbucket

Issue #480 resolved

David Platten created an issue 2017-03-02

Create RDSR from Toshiba CT dose summary images, update the information using tags in associated images, then import in to OpenREM.

Comments (82)

David Platten reporter
Initial commit for code to create RDSR from Toshiba dose summary image, and then update it with additional information contained in the associated images. Currently the code will create an RDSR and then add acquisition protocol names to each acquisition if they are missing from the intial RDSR but available in the CT image tags. References issue ~~#480~~

→ <<cset 174f83b33f47>>
- 2017-03-02T09:12:11+00:00
David Platten reporter
Initial commit for code to create RDSR from Toshiba dose summary image, and then update it with additional information contained in the associated images. Currently the code will create an RDSR and then add acquisition protocol names to each acquisition if they are missing from the intial RDSR but available in the CT image tags. References issue ~~#480~~

→ <<cset fc57a19ccca6>>
- 2017-03-02T09:23:29+00:00
David Platten reporter
@edmcdonagh. Apologies: at first I created this branch from the wrong starting branch (a 0.5 version), so I've had to remove it, and then add it again.
- 2017-03-02T09:25:15+00:00
Ed McDonagh
That would explain the odd message I had from QuantifiedCode referring to commits I couldn't find!
- 2017-03-02T09:32:40+00:00
David Platten reporter
Trying to update pitch, but the numeric value is in the wrong place at the moment. References issue ~~#480~~

→ <<cset 24787c25b826>>
- 2017-03-02T13:13:26+00:00

David Platten reporter

The pitch should have this construction (from a Siemens RDSR):

   ---------
   (0040, a010) Relationship Type                   CS: 'CONTAINS'
   (0040, a040) Value Type                          CS: 'NUM'
   (0040, a043)  Concept Name Code Sequence   1 item(s) ----
      (0008, 0100) Code Value                          SH: '113828'
      (0008, 0102) Coding Scheme Designator            SH: 'DCM'
      (0008, 0104) Code Meaning                        LO: 'Pitch Factor'
      ---------
   (0040, a300)  Measured Value Sequence   1 item(s) ----
      (0040, 08ea)  Measurement Units Code Sequence   1 item(s) ----
         (0008, 0100) Code Value                          SH: '{ratio}'
         (0008, 0102) Coding Scheme Designator            SH: 'UCUM'
         (0008, 0103) Coding Scheme Version               SH: '1.4'
         (0008, 0104) Code Meaning                        LO: 'ratio'
         ---------
      (0040, a30a) Numeric Value                       DS: '0.6'
      ---------
   ---------

My code currently produces this:

  ---------
  (0040, a010) Relationship Type                   CS: 'CONTAINS'
  (0040, a040) Value Type                          CS: 'NUM'
  (0040, a043)  Concept Name Code Sequence   1 item(s) ---- 
     (0008, 0100) Code Value                          SH: '113828'
     (0008, 0102) Coding Scheme Designator            SH: 'DCM'
     (0008, 0104) Code Meaning                        LO: 'Pitch Factor'
     ---------
  (0040, a300)  Measured Value Sequence   1 item(s) ---- 
     (0040, 08ea)  Measurement Units Code Sequence   1 item(s) ---- 
        (0008, 0100) Code Value                          SH: '{ratio}'
        (0008, 0102) Coding Scheme Designator            SH: 'UCUM'
        (0008, 0103) Coding Scheme Version               SH: '1.4'
        (0008, 0104) Code Meaning                        LO: 'ratio'
        ---------
     ---------
  (0040, a30a) Numeric Value                       DS: '0.844'
  ---------

The Numeric Value isn't in the right place. @edmcdonagh, can you help?

2017-03-02T13:16:53+00:00

Ed McDonagh

What about:

pitch_value = Dataset()
pitch_value.NumericValue = 0.6
measured_value_sequence = Sequence([measurement_units_container, pitch_value])

2017-03-02T13:45:18+00:00

David Platten reporter

That modifies things a bit, but it's still not right, as there's now 2 items in the Measured Value Sequence...

  ---------
  (0040, a010) Relationship Type                   CS: 'CONTAINS'
  (0040, a040) Value Type                          CS: 'NUM'
  (0040, a043)  Concept Name Code Sequence   1 item(s) ---- 
     (0008, 0100) Code Value                          SH: '113828'
     (0008, 0102) Coding Scheme Designator            SH: 'DCM'
     (0008, 0104) Code Meaning                        LO: 'Pitch Factor'
     ---------
  (0040, a300)  Measured Value Sequence   2 item(s) ---- 
     (0040, 08ea)  Measurement Units Code Sequence   1 item(s) ---- 
        (0008, 0100) Code Value                          SH: '{ratio}'
        (0008, 0102) Coding Scheme Designator            SH: 'UCUM'
        (0008, 0103) Coding Scheme Version               SH: '1.4'
        (0008, 0104) Code Meaning                        LO: 'ratio'
        ---------
     ---------
     (0040, a30a) Numeric Value                       DS: '0.844'
     ---------
  ---------

2017-03-02T14:29:50+00:00

Ed McDonagh

Your first attempt was nearly right. Just the NumericValue was one level too deep:

coding.CodeValue = '113828'
coding.CodingSchemeDesignator = "DCM"
coding.CodeMeaning = "Pitch Factor"
# Create the second inner coding bit
coding2.CodeValue = '{ratio}'
coding2.CodingSchemeDesignator = "UCUM"
coding2.CodingSchemeVersion = "1.4"
coding2.CodeMeaning = "ratio"
measurement_units_container = Dataset()
measurement_units_container.MeasurementUnitsCodeSequence = Sequence([coding2])
# next line is the one that is changed
measurement_units_container.NumericValue = val
measured_value_sequence = Sequence([measurement_units_container])
# Create the outer container bit
pitch_container = Dataset()
pitch_container.RelationshipType = "CONTAINS"
pitch_container.ValueType = "NUM"
# Add the coding sequence into the container.
# Sequences are lists.
pitch_container.ConceptNameCodeSequence = Sequence([coding])
pitch_container.MeasuredValueSequence = measured_value_sequence
container2b.ContentSequence.append(pitch_container)

2017-03-02T14:48:12+00:00

Ed McDonagh

If I've made the same corrections in the code above as I did in my freehand shell version, you should get the same as I did:

>>> pitch_container
(0040, a010) Relationship Type                   CS: 'CONTAINS'
(0040, a040) Value Type                          CS: 'NUM'
(0040, a043)  Concept Name Code Sequence   1 item(s) ---- 
   (0008, 0100) Code Value                          SH: '113828'
   (0008, 0102) Coding Scheme Designator            SH: 'DCM'
   (0008, 0104) Code Meaning                        LO: 'Pitch Factor'
   ---------
(0040, a300)  Measured Value Sequence   1 item(s) ---- 
   (0040, 08ea)  Measurement Units Code Sequence   1 item(s) ---- 
      (0008, 0100) Code Value                          SH: '{ratio}'
      (0008, 0102) Coding Scheme Designator            SH: 'UCUM'
      (0008, 0103) Coding Scheme Version               SH: '1.4'
      (0008, 0104) Code Meaning                        LO: 'ratio'
      ---------
   (0040, a30a) Numeric Value                       DS: '0.844'
   ---------

2017-03-02T14:49:41+00:00

David Platten reporter
Pitch value is now added if it is not present in the initial RDSR but is available from the image tag data. References issue ~~#480~~

→ <<cset a8fab866d21e>>
- 2017-03-03T10:01:19+00:00
David Platten reporter
@edmcdonagh, thanks for your help with the pitch.
- 2017-03-03T10:02:30+00:00
David Platten reporter
I'll add a "CT X-Ray Source Parameters" container to each "CT Acquisition Parameters" container next so that I can add kVp, exposure time per rotation etc. to the created RDSR.
- 2017-03-03T10:17:07+00:00
Ed McDonagh
Good stuff
- 2017-03-03T10:21:09+00:00
David Platten reporter
kVp is now added if it is not present in the initial RDSR but is available from the image tag data. References issue ~~#480~~

→ <<cset 178b514a5f20>>
- 2017-03-08T18:17:56+00:00
David Platten reporter
Modified the file layout so that it can be called using python.

Added import in to OpenREM.

Adding the extra study information doesn't seem to work at the moment.

References issue ~~#480~~

→ <<cset 6271254faa42>>
- 2017-03-08T18:33:32+00:00
David Platten reporter
Updated how study-level information is found in the images and then used to update the initial RDSR. Need to see how this works with a clinical scan. References issue ~~#480~~.

→ <<cset cf10b607d783>>
- 2017-03-08T20:16:55+00:00
David Platten reporter
Need to store the folders for java.exe, dcmtk and pixelmed.jar somewhere in the system so that I don't have to keep changing them from home to work... Perhaps in local_settings.py?
- 2017-03-08T20:18:51+00:00
Ed McDonagh
Either in local_settings.py or in a singleton in the database. Probably easier to put it in local_settings.py for now and we can reconsider later.
- 2017-03-08T20:22:10+00:00
David Platten reporter
Put paths to various tools in local_settings.py.example. Import these into the Toshiba RDSR creation routine. Should make it easier when moving from one system to the next, and I think a good idea to keep the configuration in one place. References issue ~~#480~~.

→ <<cset faef3f382d06>>
- 2017-03-08T21:54:14+00:00
David Platten reporter
kVp now added to RDSR objects created from Toshiba CT scan images and dose summary files. References issue ~~#480~~ and issue

→ <<cset 6c12eba8879f>>
- 2017-03-15T11:44:22+00:00
David Platten reporter
Requested procedure now obtained from two possible locations in the image data. Uniqueness of acquisitions improved by combining acquisition number with acquisition time. Included the Lua script that I am using to run this routine. Conquest is configured to run the script using an import converter like this:
```
ImportModality4 = CT
ImportConverter4 = process patient after 0 by openrem_import_ct.lua %p::%V0008,0070::%V0008,1090::%V0018,1020::%V0008,1010;
```
→ <<cset 9de77fc4ba9e>>
- 2017-03-15T17:11:47+00:00
David Platten reporter
I need to obtain the contents of "SoftwareVersions" (0018,1020) from the CT images and add it to the appropriate place in the RDSR.
- 2017-03-21T09:53:02+00:00
David Platten reporter
Updated routine to create RDSR from Toshiba dose summary image. This is now called from a script in the Scripts Python folder. The routine itself is now run using celery by including @shared_task before the routine. This fixes a problem that I was having where importing Toshiba CT data using this routine blocked the celery task queue. References issue ~~#480~~. I've also added an example dicom.ini file for Conquest, together with some Lua scripts that I am using to import data. I am now using Lua scripts in preference to Windows batch files. The current scripts contain some Windows-specific things, so aren't completely general at the moment. This references issue ~~#150~~

→ <<cset 2bbfc1f23d02>>
- 2017-04-04T10:33:27+00:00
David Platten reporter
Added the Toshiba CT rdsr creation routine to the extractor init.py file. Fixed an error in the extractor code. Made python line in script the same as other extractors. References issue ~~#480~~

→ <<cset 5256f5f64b81>>
- 2017-04-07T09:52:58+00:00
David Platten reporter
Small updates to Toshiba CT RDSR creation. Updated example dicom.ini files and associated Lua scripts. Also added example Windows PowerShell scripts that I am using to schedule query-retrieve from PACS. Also added example batch file that runs celery. Updated celery settings in settings.py to reflect what I am using successfully here. References issue ~~#480~~

→ <<cset 7cc8190761ce>>
- 2017-04-18T08:30:26+00:00
David Platten reporter
Reminder for myself:
- Add code to set whether to delete the folder of images or not, much like for the other extractors
- Remove Making explicit VR little endian part (isn't needed, slows things down)
- Remove DICOMDIR (isn't needed, slows things down)
- Check insertion of kVp data in to the initial RDSR
- 2017-06-01T09:56:56+00:00
David Platten reporter
Removed two stages of the extractor as they are not needed: creating DICOMDIR, which in turn required all DICOM objects to be explicit VR little endian. This also reduces the time required to create the RDSR. I've also altered the method of inserting the kVp, as I found that the previous method failed when encountering multiple axial acquisitions that use the same exposure factors. References issue ~~#480~~

→ <<cset 4bc2ee66b23a>>
- 2017-06-01T18:30:56+00:00
David Platten reporter
This extractor also works for GE LightSpeed Plus studies that include a dose summary image, although no acquisition protocol names are obtained.
- 2017-06-07T07:43:25+00:00
Ed McDonagh
Cool. Does it not get any protocol names? The pixelmed routine normally can get the overarching protocol name, like 5.13 abdomen blah blah, but not the series group names.
- 2017-06-07T08:10:32+00:00
David Platten reporter
Hi Ed. For the LightSpeed Plus in OpenREM I have:
- Accession number
- Study date and time
- Study description
- Requested procedure (same as study description in this case)
- Patient age
- Hospital
- Scanner make and model
- Study UID
- Total number of events
- Total DLP
For each scan component I have:
- Type
- CTDIvol
- DLP
- Scanning length
- 2017-06-07T08:54:18+00:00
David Platten reporter
Added extraction of exposure time per rotation. References issue ~~#480~~

→ <<cset edf33f10274c>>
- 2017-06-07T18:28:52+00:00
David Platten reporter
Removed some split lines and uncommented out the rmtree command. References issue ~~#480~~

→ <<cset 08400c6ba296>>
- 2017-06-07T18:37:10+00:00
David Platten reporter
Added x-ray modulation type, nominal single collimation width and nominal total collimation width to the extractor. References issue ~~#480~~

→ <<cset 41d61a19de0e>>
- 2017-06-08T17:14:01+00:00
David Platten reporter
Added debug logging for the Toshiba RDSR creation extractor and removed quite a few print statements. References issue ~~#480~~

→ <<cset fe16e579f3d8>>
- 2017-06-09T15:40:39+00:00
David Platten reporter
Fixing error - didn't change name of store_file to extractor_file in settings.py. References issue ~~#480~~

→ <<cset 5e311897450c>>
- 2017-06-09T15:46:23+00:00
David Platten reporter
Updated Toshiba RDSR creation routine to combine multiple RDSRs together if there is more than one dose summary in the study. References issue ~~#480~~

→ <<cset 055ec10621d5>>
- 2017-09-04T16:34:07+00:00
David Platten reporter
I need to update the code that checks if there is more than one dose summary per study. At the moment it looks for multiple instances of series number 9000 (used by one of our Toshiba scanners). However, another of our Toshiba scanners uses series number 1000 for the dose summary objects, and a third scanner uses a series number in the normal range. This third system always has the text "SUMMARY" as part of the SeriesDescription, and this combined with an SOPClassUID of 1.2.840.10008.5.1.4.1.1.7 may make it identifiable as a dose summary.
- 2017-09-06T13:11:50+00:00
David Platten reporter
Added SoftwareVersions and DeviceSerialNumber to the created RDSR. Now looking for Secondary Capture Image Storage objects when seeing if there is more than one study contained within each study uid. References issue ~~#480~~

→ <<cset faad557c6588>>
- 2017-09-06T16:31:36+00:00
David Platten reporter
Added code to check that the secondary capture object is a dose summary, and not some other type of secondary capture. I've done this because some virtual colonoscopy studies include surface rendered snapshots as secondary capture. References issue ~~#480~~

→ <<cset b516d486edd5>>
- 2017-09-22T16:50:36+00:00
Ed McDonagh
First pass at a solution for @dplatten and ref ~~#480~~. Need to work out how to test it. Refs ~~#546~~

→ <<cset 49792cceb3b0>>
- 2017-10-02T21:40:47+00:00
Ed McDonagh
Hi @dplatten. Just wanted to check if this is still in future, or if it should be tidied up and documented for 0.8.0?
- 2017-10-13T17:52:05+00:00
David Platten reporter
I think it should be in 0.8.
- 2017-10-13T18:11:18+00:00
Ed McDonagh
- changed milestone to 0.8.0
- 2017-10-14T15:08:56+00:00
David Platten reporter
Tidied up tbe Toshiba extractor a little. No functional changes. References issue ~~#480~~

→ <<cset 729f232a8367>>
- 2017-10-17T07:48:30+00:00
David Platten reporter
Started to update documentation for the new Toshiba RDSR creation extractor. Need to add info on how to install dcmtk, java.exe and pixelmed.jar. References issue ~~#480~~

→ <<cset 59326ecf883c>>
- 2017-10-18T11:23:17+00:00
Ed McDonagh
http://docs.openrem.org/en/issue480toshibardsrcreation/
- 2017-10-18T11:59:25+00:00
David Platten reporter
Updated documentation a little to fix some layout issues. Need to add info on how to install dcmtk, java.exe and pixelmed.jar. References issue ~~#480~~

→ <<cset 0064415f6501>>
- 2017-10-18T12:33:10+00:00
David Platten reporter
Added documentation for java, dcmtk and pixelmed.jar. References issue ~~#480~~

→ <<cset d0338e95a978>>
- 2017-10-18T17:03:34+00:00
David Platten reporter
Correcting typo in pixelmed.jar link. References issue ~~#480~~

→ <<cset d950e2409060>>
- 2017-10-18T17:06:30+00:00
David Platten reporter
Added documentation for using a Windows PowerShell script to schedule a query-retrieve. References issue ~~#480~~

→ <<cset 0aa91fc9a02d>>
- 2017-10-18T17:28:40+00:00
David Platten reporter
Revising PowerShell wording. References issue ~~#480~~

→ <<cset e93621f0350f>>
- 2017-10-18T17:36:31+00:00
David Platten reporter
Amending Conquest dicom.ini file to use LittleEndianExplicit for file storage. References issue ~~#480~~

→ <<cset 6a47a1381d2c>>
- 2017-10-22T08:16:12+00:00
David Platten reporter
- changed status to resolved
This is now working as I would like it to.
- 2017-10-22T08:17:57+00:00
David Platten reporter
Addressing some codacy issues. References issue ~~#480~~

→ <<cset 61f0b9802aef>>
- 2017-10-22T08:40:54+00:00
Ed McDonagh
I need to think how these files are delivered to the user - I don't think I usually keep the stuff folder in the bundle that gets uploaded to pypi...
- 2017-10-23T08:01:25+00:00
Ed McDonagh
I haven't reviewed your docs for this by the way - maybe you've covered it - apologies if you have!
- 2017-10-23T08:02:06+00:00
David Platten reporter
I've not covered anything about what's in the stuff folder.
- 2017-10-23T14:28:37+00:00
Ed McDonagh
- changed status to open
Reopen until the pull request is merged. I need to review the docs and make sure it makes sense to me.
- 2017-10-24T16:12:12+00:00
Ed McDonagh
Fixing some of the codacy complaints. Refs ~~#480~~

→ <<cset b4d0e09c1cc2>>
- 2017-10-27T21:31:14+00:00
Ed McDonagh
Fixing some of the codacy complaints. Refs ~~#480~~

→ <<cset 6a75dac4eb8a>>
- 2017-10-27T21:43:13+00:00
Ed McDonagh
Adding some more to the docs. Refs ~~#480~~

→ <<cset 4e3712a390d7>>
- 2017-10-28T13:18:06+00:00
Ed McDonagh
Correcting links and italics. Refs ~~#480~~

→ <<cset 3346e231713a>>
- 2017-10-30T22:08:18+00:00
Ed McDonagh
Added entry to release docs. Refs ~~#480~~

→ <<cset 1f9ae7687d0d>>
- 2017-10-30T22:23:13+00:00
Ed McDonagh
Adding in hrefs. Refs ~~#480~~

→ <<cset feacad4fd1dc>>
- 2017-10-30T22:32:08+00:00
Ed McDonagh
Added some linux instructions to the release version initially. Refs ~~#480~~

→ <<cset d0f1bf7471b6>>
- 2017-10-31T09:24:28+00:00
Ed McDonagh
MInor corrections and copying across to the install doc. Refs ~~#480~~

→ <<cset 455647824f8f>>
- 2017-10-31T09:40:50+00:00
Ed McDonagh
Adding comment to suppress Codacy/Bandit from flagging this as an issue. Refs ~~#480~~

→ <<cset 699c6673fc5d>>
- 2017-10-31T17:51:37+00:00
Ed McDonagh
Adding usual copyright statement. Refs ~~#480~~

→ <<cset a17caf5db315>>
- 2017-10-31T17:51:37+00:00
Ed McDonagh
No real change - trying to get PR to update. Refs ~~#480~~

→ <<cset 79a50c4be3b4>>
- 2017-10-31T18:09:15+00:00
Ed McDonagh
Adding nosec to other calls. Refs ~~#480~~

→ <<cset c8f35f436879>>
- 2017-10-31T18:12:03+00:00
Ed McDonagh
Not sure how the # nosec lost a c... Refs ~~#480~~

→ <<cset 8c744ad1f683>>
- 2017-10-31T21:15:13+00:00
Ed McDonagh
- changed status to resolved
Merged in issue480ToshibaRDSRCreation (pull request #123)

Issue480ToshibaRDSRCreation

Fixes ~~#480~~. Refs ~~#552~~ Approved-by: Ed McDonagh ed@mcdonagh.org.uk

→ <<cset d644cc567a6f>>
- 2017-10-31T22:01:07+00:00
Ed McDonagh
I keep getting a openrem_extractor.log file created in the openrem folder - and I can't see why. Any ideas?
- 2017-11-01T22:09:52+00:00
David Platten reporter
Started to document Conquest configuration using lua, including forwarding Toshiba CT data to the RDSR creation importer. References issue ~~#480~~

→ <<cset b8df0e4a4016>>
- 2017-11-06T17:14:15+00:00
David Platten reporter
Some minor updates to the Conquest configuration document. References issue ~~#480~~

→ <<cset 0246acb38c9c>>
- 2017-11-06T17:33:22+00:00
David Platten reporter
Some minor updates to the Conquest configuration document. References issue ~~#480~~

→ <<cset ec6233414697>>
- 2017-11-06T17:36:17+00:00
Ed McDonagh
Thanks for doing this @dplatten. One you might want to edit:

"The above script depends on openrem_string_split are"
- 2017-11-06T17:46:27+00:00
David Platten reporter
Minor correction to Conquest documentation. References issue ~~#480~~

→ <<cset dd6faeae9cd2>>
- 2017-11-06T17:56:34+00:00
David Platten reporter
Updated Conquest configuration document and added new document containing a full example dicom.ini file. References issue ~~#480~~

→ <<cset 404b3e117d7c>>
- 2017-11-08T10:48:14+00:00
David Platten reporter
Added link to example Conquest dicom.ini file doc to the netdicom doc. References issue ~~#480~~

→ <<cset 472041b11403>>
- 2017-11-08T10:51:11+00:00
David Platten reporter
Correcting link. References issue ~~#480~~

→ <<cset 49c777e21d48>>
- 2017-11-08T10:53:25+00:00
Log in to comment

Assignee: David Platten

Type: enhancement

Priority: minor

Status: resolved

Component: Import: CT

Milestone: 0.8.0

Votes: 0

Watchers: 1

Jira: the preferred issue tracker for Bitbucket. Join the team!