Data Collection and Transmission to the OOI Cyberinfrastructure
Data are gathered by both cabled and un-cabled (wireless) instruments located across multiple research stations in the Pacific and Atlantic oceans. Once acquired, the raw data (consisting mostly of tables of raw instrument values – counts, volts, etc.) are transmitted to one of three operations centers: Pacific City, directly connected via fiber optic cable to all cabled instruments in the Cabled Array; Oregon State University (OSU), an Operational Management Center (OMC) responsible for all un-cabled instrument data on the Pacific coast; and Woods Hole Oceanographic Institute (WHOI), the OMC for Atlantic coast-based un-cabled instrument data. The data from the operations centers is transferred to the OOI Cyberinfrastructure for processing, storage and dissemination.
Data Management, Storage, and Processing
Two primary cyberinfrastructure (CI) centers operated by the Rutgers Discovery Informatics Institute (RDI2) are dedicated to OOI data management: the West Coast CI in Portland, Oregon, and the East Coast CI, at Rutgers University. While data from the Cabled Array components are initially received at the Shore Station in Washington, it is the East Coast CI that houses the primary computing servers, data storage and backup, and front-facing CI portal access point, all of which are then mirrored to the West Coast CI over a high-bandwidth Internet2 network link provisioned by MAGPI (Mid-Atlantic GigaPOP in Philadelphia) on the east coast and PNWGP (Pacific-Northwest GigaPOP) on the west coast. The data stores at the OMCs at OSU and WHOI are continuously synchronized with the data repositories located at the East and West Coast CI sites.
Data Safety & Integrity
Data safety and protection is ensured in two ways: data security and data integrity.
- Data security is addressed through the use of a robust and resilient network architecture that employs redundant, highly available next-generation firewalls along with secure virtual private networks (VPN).
- Data integrity is managed through a robust and resilient Information Life-cycle Management (ILM) architecture that integrates redundant enterprise storage area network (SAN) (disk-based) and a robotic library (tape-based).
- SAN, an enterprise-level storage network of multiple hard drives managed by an intelligent device manager, reduces the data footprint by reducing data duplication while maintaining data integrity and access performance through storage redundancy.
- Tape storage, a “last tier” storage that is not dependent on power or cooling, supports longer-term backup and archiving, disaster recovery, and data transport.
Public Data Access
The OOINet Ecosystem employs the uFrame Service-Oriented Architecture (SOA) software framework that processes the raw data and presents it in visually meaningful and comprehensible ways in response to user queries. Users can access this software over the Internet through the CI portal access point hosted at Rutgers.