Top Menu

How is the raw data archive organized?

The archive is a mirror of the data repository where all raw data enters the system. To view a file-tree structure of the archive, follow this link: https://rawdata.oceanobservatories.org/files/.tree.html

For uncabled mooring data (any platform not attached to the electro-optical cable on the west coast) that is a server organized by deployment/recovery number for each uncabled platform (e.g. D00001/R00001), with subfolders for each node (the control computer to which the instruments are attached), containing a subfolder for each attached instrument which contains all data for that instrument from that deployment.

e.g. https://rawdata.oceanobservatories.org/files > CE01ISSM > D00001 > dcl16 > flort

Uncabled mobile assets (aka. Gliders and AUVs) are organized similarly, but the subfolders below the deployment/recovery number are organized based on the glider’s internal science computer file structure. The telemetered data folders are titled D0000*, and contain subfolders called archive, from-glider, logs, and merged-from-glider. The recovered data folders (R0000*) are titled cache, merged, dvl, flight, and science. In both cases, the “merged” folders contain most of the science instrument and engineering data of interest, although the glider ADCP data is contained in the “dvl” subfolder.

Cabled data are currently being pulled from two archives. Most cabled data is organized by node, which refers to the alphanumeric ID of the junction box attached directly to the undersea cable that aggregates, time-stamps, and routes the data from all instruments connected to that node. Certain high-bandwidth instruments (like hydrophones, HD video, and sonar systems) are organized by date (yyyy/mm). Eventually, all cabled data will be pulled from the same port agent log archive, and will be organized by date.

— Last revised on August 15, 2016 —