Seed data refresh - OregonDigital/hyrax-migrator GitHub Wiki

  1. Setup: create OD collections and admin sets:
bundle exec rake oregon_digital:create_admin_sets
bundle exec rake oregon_digital:create_collections
  1. Transfer tar.gz from OD1 to OD2:
# seed data bags in OD1

$ ls -la /data1/batch/exports/*_seed_data.tar.gz
Baseball_jpegs_seed_data.tar.gz
Campus_Maps_seed_data.tar.gz
Century_Farm_and_Ranch_applications_seed_data.tar.gz
Harriets_Photograph_Collection_seed_data.tar.gz
Ken_Gray_Insect_Image_Collection_seed_data.tar.gz
Lichens_of_the_Pacific_Northwest_seed_data.tar.gz
Oregon_Multicultural_Archives_Asian_and_Pacific_Cultural_Center_Album_1_seed_data.tar.gz
Oregon_Multicultural_Archives_Assorted_seed_data.tar.gz
Oregon_State_University_Sports_Media_Guides_seed_data.tar.gz
Oregon_State_University_Yearbooks_seed_data.tar.gz
OSU_Student_Protest_and_Underground_Publications_seed_data.tar.gz
Pacific_Northwest_Stream_Survey_seed_data.tar.gz
SCARC_Audio_files_seed_data.tar.gz
The_Daily_Barometer_seed_data.tar.gz
# transfer Baseball_jpegs_seed_data.tar.gz from OD1 to OD2

kubectl exec -it <workers-pod> bash
cd tmp/shared
scp deploy-rails@<OD1-worker>:/data1/batch/exports/Baseball_jpegs_seed_data.tar.gz .
  1. Decompress tar file
kubectl exec -it <workers-pod> bash
cd tmp/shared
tar xvzf Baseball_jpegs_seed_data.tar.gz
ls -la Baseball_jpegs_seed_data_bags
  1. Zip pids in bag folder
cd tmp/shared
vim zip_bags.sh

#!/bin/bash
cd $1
for assetdir in *
do
  if [ -d $assetdir ]
  then
    cd $assetdir
    zip -r ../$assetdir.zip .
    cd ..
    rm -rf $assetdir
  fi
done
chmod +x zip_bags.sh
./zip_bags.sh Baseball_jpegs_seed_data_bags
  1. Run migrator
kubectl exec -it <workers-pod> bash
bundle exec rails c
i = Hyrax::Migrator::Services::BagIngestService.new(['Baseball_jpegs_seed_data_bags'], Hyrax::Migrator.config)
i.ingest
⚠️ **GitHub.com Fallback** ⚠️