Due to the nature of their respective standards, media items that are on optical discs require distinct preservation workflows from other types of digital content. The following information presents workflows and tool examples that facilitate the preferred preservation deliverables of UW Libraries for these formats.
Format | Preferred Deliverable | Recommended Tools |
---|---|---|
CD-DA (Audio CD) | 44.1 kHz/16 Bit Single file (WAV or FLAC) with CUE Sheet | Exact Audio Copy, XLD |
DVD Video | ISO Disc Image File | Isobuster, ddrescue |
Audio CDs (known as CD-DA for Compact Disc Digital Audio) contain a raw stream of audio data with an index and thus can't be 'imaged' in the same way as other digital media. Also - due to compromises made between error correction and storage space, when migrating audio CDs it is important to use specialty software that is capable of multiple passes. (As an example - according to this paper by AVP, consumer players average about a 5% error rate, so a single pass would risk only accurately capturing 95% of the audio content per disc).
The following workflow will provide recommendations for suitable software, as well as brief notes on other CD-DA specific considerations.
Audio CDs should be migrated to a single file containing all audio information, and accompanied by a CUE Sheet file. This CUE Sheet is a plain text document that provides metadata such as in/out times for audio tracks, and naming information if present. (See sample CUE Sheet for an example).
Audio files should be migrated to a lossless format that preserves the original characteristics of the CD Source (16 bit, 44.1 kHz). UW Libraries prefers either uncompressed WAV files, or files losslessly compressed in the FLAC format.
The program Exact Audio Copy (EAC) is highly recommended for CD-DA migration. EAC is a Windows program that is free for non-commercial use. While it is native in the Windows system, it can be reliably run on both Linux and macOS via the WINE Compatibility Layer. ** Currently WINE does not work well with macOS 10.15, and EAC is unable to be used.
Upon first running EAC, it will need to be configured for use with your CD Drive. When you launch it, it will offer options to guide you through the process. A good guide for this process can be found through the Hydrogen Audio Wiki.
Upon inserting a CD, you will have the option of editing the metadata that will be stored in the CUE Sheet. At minimum, the Title
field should be assigned to the most appropriate identifier (often the item number for archival materials).
Once metadata entry is complete, to create the desired deliverables of a single audio file with associated CUE sheet, select Copy Image & Create CUE Sheet
from the Action
menu. Select Uncompressed
if generating WAV files, or Compressed
if you have set up EAC to capture FLAC.
While Exact Audio Copy can be used to split Wave files with CUE sheets into individual tracks (by selecting Split Wave by CUE Sheet
from the tools menu, it is recommended to use a different tool that enables compression for derivative generation.
One such tool that works well for this purpose is CUE Tools. Within CUE Tools, after selecting the desired input and output locations, set Action
to Encode
, Mode
to Tracks
and Audio Output
to Lossy
(while selecting mp3 from the dropdown menu). Pressing Go
will begin the splitting and transcoding of your FLAC or WAV master file into track based derivative files.
Different CD drives start their playback with slightly different offsets of samples. This means, that if it is desired to verify the integrity of a transfer by using checksums to compare against other transfers of the same disc (such as through a database like Accuraterip) this offset must be established and the transfer software must be calibrated accordingly.
If using Exact Audio Copy, the drive model information will appear in the upper left corner of the window:
This model number can then be used to search in the Accuraterip list of drive offsets.
Some CDs were created using frequency emphasis that must be accounted for and reversed on playback for correct sound. (This is analogous to the RIAA curve used on LPs and their need to be played back with a phono preamp). CD Players that are standards compliant will be able to detect emphasis, however this will not automatically occur during any migration. This is relatively rare, and occurs mostly on older CDs from the 1980s. Nonetheless, it should be considered when planning any migrations that might contain affected materials. Should an example of a CD containing pre-emphasis be desired, Can't Slow Down by Lionel Ritchie is available in the UW Libraries Collection.
To accurately preserve both the contents of the optical disk, as well as the intended presentation experience of the CD, if a CD contains pre-emphasis two versions of the content should be created. A 'flat' version of the CD should be migrated, with the appropriate flags indicating pre-emphasis contained in the associated CUE sheet. Any CDs reproduced from this combination will play correctly with de-emphasis applied. In addition to this, a digital listening copy should be created that has had a de-emphasis filter applied. For methods to do this, see Appendix A.
To capture the content of a video DVD, an ISO disc image file should be created (as opposed to loading the disc and then copying the VOB video files off). This allows for better handling of any errors in the disc, as well as facilitates both easier playback and derivative generation. If an ISO image of a DVD is played back using software such as VLC, not only the video but things like the menu will be viewable in a close approximation of their original format. Additionally ISO images can be used as inputs for tools such as Handbrake to create access MP4s as desired.
Note - some commercial DVDs contain anti-piracy technology that under the DMCA is illegal to circumvent. There are limited exceptions to this rule - before attempting to image any commercial DVDs please confirm the legal issues surrounding your project.
A single ISO disc image file should be created for each DVD. This ISO should be named according to a convention that allows it to be associated with the original item (for example Item_Number.iso).
- Readom
- DDRescue
- Sample command:
ddrescue -d -b 2048 -r4 -v /dev/sr0 mydisk.iso mydisk.log
- Sample command:
- Brasero
Isobuster:
First you must select which drive you will be capturing from. This can be done via the dropdown in the main window.
Next make adjustments (if desired) to the amount of times Isobuster will attempt to retry sections of the DVD that have read errors. This is under the Communication
tab in the Options
menu. Three or four retries is a reasonable amount - raising the number of retries too high can result in VERY long processing times.
Finally, to create the disk image, navigate according to the following image. File
-> Extract DVD <image>
-> User data (*.iso)
. Assign the file a uniqe name, and proceed!
ISO disc image files can be used as inputs for the popular DVD Ripping program Handbrake. To create an access MP4, open you ISO file in Handbrake in the same manner as a DVD.
There are several methods to apply a de-emphasis filter. If the CD is still available, ripping it via the Apple iTunes software will automatically create a de-emphasized version. If de-emphasis needs to be applied to an already existing file, the following are two options.
FFmpeg
The media manipulation tool FFmpeg contains an EQ filter that can reverse emphasis with the following command: ffmpeg -i input_file.wav -af aemphasis=type=cd deemphasized_file.wav
Command from ffmprovisr.
A de-emphasis curve can be added to the popular tool Audaciy, which can then be applied to a target file.
The following text should be added to the Audacity configuration file EQCurves.xml
<curve name="RedBookPre">
<point f="700.000135021151" d="0.000000000000"/>
<point f="1000.101652520304" d="-0.500512741089"/>
<point f="1950.142416511543" d="-1.398496627808"/>
<point f="3000.068564380608" d="-2.502575836182"/>
<point f="5000.000948851137" d="-5.004147338867"/>
<point f="10000.005928346302" d="-7.520062176514"/>
<point f="20000.322508988818" d="-9.504367828369"/>
</curve>
The file can then be edited in audacity and the new curve applied via the 'Equalization' effect.
Curve was created using the information in this paper by a user in the Audacity Forums.