Working with documents - Paperight/website GitHub Wiki
Processing documents
Once a set of documents have been received the following process needs to be followed:
Place documents in the 'working-folder'
All files need to be placed in the Dropbox working folder. Within each publishers folder, a source folder must be created, and the PDF, epub, or HTML files received from the publisher must be placed in that folder. For more on folder management, see the naming conventions section below.
Extract cover images
Cover files must be extracted and saved as JPEGs and renamed (see naming conventions below).
Convert documents
Create Paperight PDFs for each document. You can find a detail explanation of how to do this in the wiki article on 'Document creation'.
Create a CSV
Create a .csv file. For more on this, read the wiki article on 'CSV & Metadata management'.
Upload documents
Manual upload
Upload the Paperight PDFs to the FTP site:
- Open FileZilla and log in to the Paperight SFTP with these details.
- Click on the the PDF folder (make sure you get the correct one).
- Drag and drop the files into the appropriate folder.
- Click on jacket_images and drag and drop the cover images into that folder.
- You can now either upload a .csv file or add the products by individual upload on the server.
Automated upload
Via HTML
Upload the scrubbed HTML to the FTP site:
- Open FileZilla and log in to the Paperight SFTP with these details.
- Click on the the HTML folder (make sure you get the correct one).
- Drag and drop the files into the appropriate folder.
- Click on jacket_images and drag and drop the cover images into that folder.
- Open the Media FTP in FileZilla in a new tab.
- Drag and drop the cover images into /public_html/content/jacket_images.
- Convert documents using the automated HTML-PDF converter.
Via PDF
- Open FileZilla and log in to the Paperight SFTP with these details.
- Click on jacket_images and drag and drop the cover images into that folder.
- Convert documents using the automated PDF-PDF converter.
Document archiving
After an upload of books to the site has been completed, the files relating to those books that are in the working folder must be archived. The archive needs to happen in two places - in the content-archive folder in the Content Share section of Dropbox, and on the external hard drive.
Make sure that:
- All of the cover images are saved in the uploaded-cover-images folder, and in DropBox/Paperight Admin Share/promotion/materials/covers.
- All Paperight PDFs (exported from InDesign) are saved in the uploaded-paperight-pdfs folder.
- All HTML is saved in source-html folder.
- All cropped PDFs are saved in source-pdf folder.
- The CSV you used to do the upload is saved in the _uploaded-csv-files folder.
- The metadata spreadsheets are saved in the metadata folder.
- The source folder and InDesign document (in the case of a manual doc creation) are saved together in the product-files folder.
Repeat this process first on the external hard drive, and then in Dropbox.
File/folder naming conventions
File-naming conventions are really important when you update, edit, or add a new document to the Paperight shared folder (or for any Paperight documents you create). There are two options when it comes to file-naming conventions, the one is used for working files and the other is used for document and book files. In the content team we prefer to use the latter, which is based on hyphenation, e.g. bleak-house_charles-dickens_20120126.pdf.
First, read up about folder structure in the wiki article on 'Folder-naming conventions'.
Then, read up on file naming in the wiki article on 'File-naming conventions'.