Forums: General Discussion
Hi Nalin,
First off, thank you for using the OpenCDISC Validator. We're very happy that you've found it useful, and we hope to continue improving it with each release.
Now, let me see if I can address some of your points:
Regards,
Tim
Tim,
For generating define.xml from XPT, will you kindly also consider allowing users to re-use the dataset label stored within the dataset files in addition to the current method of, I think, drawing the information in config-sdtm-3.1.2.xml? The former may be the default behavior. (Side note: perhaps, same goes to variable labels, order of variable, etc.)
Yes, it seems to be really useful to have a central control file for inputting the metadata neccessary to create a fully qualified define.xml, e.g., the inter-relationship between variable to value-level metadata to codelist; ordering; and, external dictionaries, etc.
Regards,
Anthony
Hello.
I wanted to start by saying that OpenCDISC is a great tool and does most of the heavy lifting for both SDTM validation and creation of define.xml... Great effort.
Having said that I have the following questions/comments on the define.xml file:
1. For the variables where the controlled terminology name (eg country) is from the IG, dont we need to have the name in 'Controlled Terminology' column hyperlinking to the 'Controlled Terminology' section in the define.xml like the sponsor defined CT? I see checks on the data compared to the standard CT (eg. Country) but there is nothing in the 'Controlled Terminology' column in the variable level metadata column.
2. Dont we need to have the keys column populated at the dataset level metadata? How can this be done?
3. I believe the 'Description' column of the dataset level metadata section pulls the dataset labels from SDTM IG. I was wondering if it would make sense to pull this information from the dataset metadata and have check to confirm that the values provided match the IG. I am reviewing some SDTM datasets provided by a vendor and they dont use the same labels for the datasets as provided in the IG and hence there is a disconnect between the define.xml and the actual data. Few more advantages to using actual dataset metadata:
a. Currently custom/sponsor defined domain dont have any label in the define.xml
b. SUPPXX datasets dont have any labels in the define.xml
Thanks in advance.
Nalin