Hello,
I have only found raw formats which are dumps of the cards, or just strings if the card uses some encoding. However I haven't found a format that could serve as long-term storage with archivistic purposes due to the lack of metadata and mechanisms of verification. For this reason I would like to propose what I have in mind.
Format PCD
The extension stands for "Punched Card Deck". It would be basically a zip with the extension changed. In the root of that file we would find a picture of the deck in JPEG format, a XML with the metadata related to the deck and a number of PCF files. The XML should contain a list of the PCFs as well as a CRC code to verify them, among other things. The order of the cards should also be specified and if there's a sequence pattern, also it should be noted in there.
Format PCF
The extension stands for "Punched Card File". It would also be another zip with the extension changed. It should contain a scan of the card in JPEG, an XML for the metadata and a binary file standing for the contents of the card. The binary should be in SIMH format (160 bytes) or in packed format (120 bytes). The XML should contain any information to describe the card, including the format, platform, sequence number if present, etc. There should also be a CRC for verifying the binary file.
As you can see, I have the structure and the parts altready thought. However, I would need help with the XMLs. I don't know how to define XML schemas and also would be great if someone could give me some hints about fields that could help describe it.
Thank you very much!
I have only found raw formats which are dumps of the cards, or just strings if the card uses some encoding. However I haven't found a format that could serve as long-term storage with archivistic purposes due to the lack of metadata and mechanisms of verification. For this reason I would like to propose what I have in mind.
Format PCD
The extension stands for "Punched Card Deck". It would be basically a zip with the extension changed. In the root of that file we would find a picture of the deck in JPEG format, a XML with the metadata related to the deck and a number of PCF files. The XML should contain a list of the PCFs as well as a CRC code to verify them, among other things. The order of the cards should also be specified and if there's a sequence pattern, also it should be noted in there.
Format PCF
The extension stands for "Punched Card File". It would also be another zip with the extension changed. It should contain a scan of the card in JPEG, an XML for the metadata and a binary file standing for the contents of the card. The binary should be in SIMH format (160 bytes) or in packed format (120 bytes). The XML should contain any information to describe the card, including the format, platform, sequence number if present, etc. There should also be a CRC for verifying the binary file.
As you can see, I have the structure and the parts altready thought. However, I would need help with the XMLs. I don't know how to define XML schemas and also would be great if someone could give me some hints about fields that could help describe it.
Thank you very much!