WIP
This page is still work in progress.
The CTA Tape Lifecycle¶
Introduction¶
In CTA, each tape information is stored in the CTA Catalogue and are managed by the mean of cta-admin
command.
Different tape states and supported state transitions¶
What can be done on each final state¶
State | Queue user read requests | Queue user write requests | Queue repack requests | Queue repack read sub-requests (*) | Queue repack write sub-request(*) | Mountable | Reclaimable |
---|---|---|---|---|---|---|---|
ACTIVE | YES | YES | NO | NO | YES | YES | YES |
DISABLED | YES | YES | NO | NO | YES | NO | YES |
REPACKING | NO | NO | YES | YES | NO | YES | NO |
REPACKING_DISABLED | NO | NO | YES | YES | NO | NO | NO |
BROKEN | NO | NO | NO | NO | NO | NO | NO |
EXPORTED | NO | NO | NO | NO | NO | NO | NO |
(*) Repack sub-request queueing is handled internally by the maintenance process.
Tape states explained¶
- ACTIVE:
- A tape that is
ACTIVE
is a tape that is in good condition to allow a user to read data from it or to write data to it. A newly added tape to CTA will beACTIVE
by default. - DISABLED:
- A tape that is
DISABLED
cannot be mounted, but can still have retrieve requests queued to it. - Can be set for the following reasons:
- A tape server disabled it after encountering some issue that lead to failure (example: failed dismount).
- Monitoring probes decided to disable it (too many errors over the past XX hours,...).
- An operator think the tape is in bad shape and must be kept away from users while it is investigated.
- A tape should not stay in this state for more than 1 week, person on rota to follow up on tapes disabled for longer than one week.
- REPACKING:
- A tape should be moved to
REPACKING
state before the operator submits a repack request. Otherwise, the repack request won't be accepted. - Likewise, it's not possible to move out of
REPACKING
while a repack request is ongoing (except for theREPACKING_DISABLED
state). - When a change to
REPACKING
is requested, the tape will first move to the temporary stateREPACKING_PENDING
. Then, it waits for the maintenance process to clean all user requests on the tape queue, before finally moving it toREPACKING
state. - REPACKING_DISABLED:
- The state
REPACKING_DISABLED
is similar toDISABLED
, but for repacking tapes (we can't move out of repacking states while a repack is ongoing). - A tape should not stay in this state for more than 1 week, person on rota to follow up on tapes disabled for longer than one week.
- New repack requests can be queued on a
REPACKING_DISABLED
tape. However, no tape will be mounted while it's on this state. - BROKEN:
- A tape can stay in this state for long and it is very likely that it is its very last state.
- When a change to
BROKEN
is requested, the tape will first move to the temporary stateBROKEN_PENDING
. Then, it waits for the maintenance process to clean all user requests on the tape queue, before finally moving it toBROKEN
state. This guarantees that all requests are properly disposed of. - Can be set for the following reasons:
- A problematic (earlier
DISABLED
) tape that requires non-trivial efforts for its data to be recovered: - low level slow tape extract is needed
- sent for data recovery to the vendor = the tape is not present in the library
- all recovery attempts exhausted, the tape is permanently broken but experiment action is needed (delete the lost files from the catalogs)
- In rare cases an
ACTIVE
tape that fall on the floor because of a gripper incident can move fromACTIVE
straight toBROKEN
as it must be physically put back in a slot.
- A problematic (earlier
- Normally, very few tapes are in the BROKEN state.
- No operations are allowed for a
BROKEN
tape. - EXPORTED:
- A tape can stay in this state for long. It means that the tape was removed from the tape library.
- While we at CERN we do not remove tape cartridges from tape libraries, other sites do. Therefore, the
EXPORTED
state should help to distinguish that. - This state would behave very similarly to
BROKEN
state. The difference is on the error messages reported to the user.