Wikisource Loves Manuscripts/Technical Guidelines
Wikisource Loves Manuscripts is a call-to-action and a project to support the digitization of manuscripts on Wikisource.
This page contains technical guidelines compiled by the Wikisource Loves Manuscript team. The following document is an initial publication of a manuscript digitalization guide with a case study in Indonesia. Generally, it might be relevant to the condition of your manuscript collection and the equipment you currently have. However, this guide is not static. If you find that this guide differs from your conditions and needs, it may be necessary to adjust accordingly. We are looking forward to your feedback for the further improvement of this guide.
General Workflow
[edit]In general, the process and workflow of the Wikisource Loves Manuscripts program in its efforts to preserve manuscripts in Indonesia are as follows.
Assessment and Agreement
[edit]The first step that must be taken in efforts to preserve manuscripts is assessment. This activity includes identification and initial research on the potential for a manuscript collection in a certain area. It aims to map the initial conditions of a collection, the necessary action plans, and the urgency of digitizing the manuscripts of a collection.
In addition, it also aims to obtain permission from the owner of the manuscript collection. Permission must be obtained with a sense of mutual trust between the researcher or assessor and the owner of the manuscript. It is important to emphasize that the Wikisource Loves Manuscripts program will not transfer ownership of the physical collection. The entire series of digitization activities is carried out at the place where the manuscript collection is kept by the owner. Permission from the owner of the manuscript collection is indicated by a statement of willingness to digitize and open access to the manuscript.
Digitizing Mission
[edit]After permission for manuscript digitization activities and open access has been issued by the owner, the team can start converting the manuscript media into digital format. Each page of the manuscript is photographed in accordance with the procedures stated in this book.
Metadata Filling
[edit]In the digitization process, capturing the metadata, both physical and manuscript text, is an integral part. The manuscript metadata of a collection is captured by an academic expert who understands the character of the manuscript collection and its cultural background. This is done so that the accuracy of the manuscript data presented is guaranteed and makes it easier for readers to do further research.
Digital Manuscripts Distribution to the Owners
[edit]After all manuscripts have been transferred to digital format and the metadata is complete, the digitized copies can be distributed to the owner of the manuscript collection. In addition, this program also works with local manuscript preservation communities. This is done with the aim of creating synergy between the digitization team and manuscript owners to jointly care for the collection through digital files and optimize their utilization. This also shows that basically the Wikisource Loves Manuscripts program is an effort to help manuscript owners to be able to care for and preserve their manuscripts so that all the digitization results that have been carried out can also be obtained by the manuscript owners and the community around the collection.
Images and Metadata Quality Control
[edit]An equally important process in digitizing manuscripts is quality control. This work is carried out by two parties, namely the digitization team in the field and the Wikisource Loves Manuscripts data manager team in Jakarta. This activity is carried out to ensure that there were no errors while scanning the manuscripts and adding the metadata.
Images Conversion
[edit]After confirming the validity of all digitized manuscripts and metadata, images from .JPG/.RAW files are converted so they could be adapted to the Wikimedia Commons platform. This conversion process is carried out by the Wikimedian-in-Residence or a community member.
Images and Metadata Uploading
[edit]The next step is uploading the resulting digitized manuscript images and metadata to Wikimedia Commons and Wikidata. After the file upload process is complete, digital manuscripts and their metadata can be accessed openly by all groups via the Wikisource Loves Manuscripts web page.
Proof-read a thon and Community Engagement
[edit]Good digitization results must of course be supported by optimal use by all stakeholders as well. Therefore, one of the efforts to campaign for the importance of preserving manuscripts and their utilization in the Wikisource Loves Manuscripts program is Proofread-a-thons. This is to ensure that the Wikimedia community and manuscript activists are engaged to participate in developing digital manuscript-based research infrastructure by transcribing these digital manuscripts.
Assessments Process
[edit]One of the early stages in implementing the digitization of manuscripts in Indonesia is an assessment activity aimed at collecting initial data and understanding the condition of manuscripts in a collection. Several things that need to be considered in WiLMa assessment activities are as follows.
Collection Location Mapping
[edit]In an effort to save manuscripts, mapping of collection locations is very important. Assessors who wish to review a collection should know and carry out simple research and collect as much information as possible using available sources related to the location and the collection. All information obtained is very useful for establishing communication with the owner of the manuscript collection to be addressed in digitization activities.
Application for access permits and digitalization activities
[edit]After obtaining information regarding the intricacies of the collection and the environment around it, the assessor is advised to build a relationship with the owner of the manuscript collection or people who can connect the assessor with the owner of the manuscript collection. After good relations and communication are established, the assessor submits an application for permission related to the activity of saving the manuscripts of a collection to the owner.
Applications for permission to the owner of the manuscript collection must consider the aspects of a humane approach without the slightest coercion or intimidation towards the owner of the manuscript. The assessor is also prohibited from promising something in the form of unrealistic material or other things outside the assessor's authority to the owner of the manuscript to avoid unwanted confusion in the future.
Some important things that need to be conveyed in the process of submitting an application to rescue manuscripts by Wikisource Loves Manuscripts in Indonesia are as follows:
- The Wikisource Loves Manuscripts in Indonesia activity is an effort to document manuscripts in Indonesia by digitizing manuscripts into digital photo formats with the aim of opening the widest possible access to information regarding manuscripts in a collection.
- Wikisource Loves Manuscripts in Indonesia does not take or transfer ownership of digitized manuscripts to certain parties. Ownership of the manuscript remains in its original condition.
- All collection owners who wish to participate in the rescue of the manuscript collection organized by Wikisource Loves Manuscripts in Indonesia are entitled to a copy of the digitized collection.
First Handling on the Collections
[edit]After the owner of the manuscript collection allows access and digitization activities, the assessor is required to carry out an initial inventory of the collection and perform first aid for the manuscripts if they are in an alarming condition such as carrying out simple cleaning, separating manuscripts damaged by termite or insect invasions, providing additional materials that can repel paper-destroying insects, and grouping manuscripts to be digitized.
Assessment and Agreement Form
[edit]In the assessment activities carried out, the assessor is required to fill out a form containing data on a brief description of the collection, the number of collections, and the background of the owner of the manuscript. The forms that have been filled in according to the results of the assessor's visit will be used as a reference in the manuscript rescue and digitization activities by Wikisource Loves Manuscripts in Indonesia of the manuscript collection. The assessment form can be accessed via the following link:
- Printed form: See it here
- Online form: can be made with any online application facilities
After completing the assessment form, the assessor is required to submit a signature request to the owner of the manuscript in the document of willingness to digitize. This document states that the owner is willing to fully digitize his collection of manuscripts and the results can be accessed open to the public (open access) through the Wiki platform. The draft of agreement letter for digitization can be seen here.
Basically, the entire series of assessment activities for a collection of manuscripts is carried out with the aim of opening up the involvement of all parties in the preservation of manuscripts through an approach that prioritizes human aspects so that awareness and responsibility for preserving manuscripts can be carried out collectively, continuously, and the information contained in these manuscripts can be sustainable and spread benefits to society.
Tools and Equipment for Digitizing Manuscripts
[edit]In order to support manuscript digitization activities that demand the best image quality, of course, the best technical equipment is also needed. According to the Wikisource Loves Manuscripts digitization standards, each digitizing team will carry one set of instruments which includes the following items.
Camera and Lens
[edit]The camera is a device for taking and storing pictures in digital format. The recommended camera type is a DSLR (the Digital Single Lens Reflex) camera. This is because the type of DSLR camera has advantages in terms of framing, focus and exposure. In addition, this type of camera also fully supports control from computer devices, making it easier for photographers to check the quality of each image produced.
The recommended camera for digitizing WiLMa manuscripts is a DSLR camera. In its implementation, the Wikisource Loves Manuscripts team was equipped with a camera with the following specifications:
Camera: Canon EOS 6D Mark II
- 26.2MP Full-Frame CMOS Sensor
- DIGIC 7 Image Processor
- 45-Point All-Cross Type AF System
- Full HD Video at 60 fps; Digital IS
- 3″ 1.04m-Dot Vari-Angle Touchscreen LCD
- Dual Pixel CMOS AF and Movie Servo AF
- Native ISO 40000, Expanded to ISO 102400
- 5 fps Shooting; Time-Lapse & HDR Movie
- Built-In GPS, Bluetooth & Wi-Fi with NFC
- Dust and Water Resistant; SD Card Slot
<image here>
Lens: Canon EF 50mm f/1.4 USM
- Aperture Maximum: f/1.4 Minimum: f/22
- Two High Refraction Lens Elements
- Ultrasonic Autofocus Motor
- Manual Focus Override
- Focal length 50 mm
- Minimum Focus Distance 1.5' (45 cm)
<image here>
Lighting unit
[edit]In the world of photography, light plays an important role in producing the best images. This also applies to the digitization of manuscripts which demands the best image quality to make it easier for users and provide comfort in reading and accessing these manuscripts. Therefore, maximum lighting is absolutely necessary in digitizing manuscripts. Wikisource Loves Manuscripts in Indonesia uses flash photography techniques in manuscript digitization activities. One tool that can be used is the Godox Mini Pioneer Studio Flash package with the following specifications.
- Flash Output: 160 WS
- Guide Number (ISO 100): 43
- Color Temperature: 5600±200K
- Power Voltage: AC220V 50HZ / AC100-120V 60HZ
- Flash Power Control: Stepless
- Modeling Lamp: 75W
- Recharging Time: 0.5-2s
- Flash Triggering Method: Sync cord, Slave, Test button, Flash trigger
- Triggering Voltage: 5V / DC
- Flash Duration: 1/2000-1/800s
- Fuse: 5A
<image here>
Copy stands and Tripods
[edit]The use of copy stands aims to provide camera stabilization in the digitization process. It also has horizontal arms that can be adjusted according to the position of the object. Copy stands with a 70cm column combined with a 50mm macro lens in a DSLR camera will cover an area of 5 x 7.5 cm up to 28 x 42 cm.
<image here>
Computing Devices
[edit]In the activity of digitizing manuscripts, a capable computing device is absolutely necessary to control the camera and process the resulting images. Therefore, the recommended computer equipment is a device that supports the specifications of an image processing software with the following minimum conditions.
- OS: 64-bit Windows 10 or equivalent
- Processor (AMD): Athlon X4 880K @4GHz or equivalent
- Processor (Intel): Core i3-6100 @3.7GHz or equivalent
- Memory/RAM: 4 GB
- Graphics card (AMD): Radeon HD 7850 or equivalent
- Graphics card (NVIDIA): GeForce GTX 660 or equivalent
- Hard-drive space: 50 GB
The computer equipment used in digitizing Wikisource Loves Manuscripts is as follows.
Laptop: HP Pavilion Gaming 15
- OS: 64-bit Windows 11
- Processor (Intel): Core i5-11300H up to 4.4 GHz
- Random Access Memory: 8 GB
- NVIDIA® GeForce RTX™ 3050 Laptop GPU (4 GB GDDR6 dedicated)
- Internal hard-drive space: 1000 GB
<image here>
A laptop with gaming specifications was chosen because it is equipped with specifications with the ability to process large files with consideration of components such as integrated VGA card, latest processor generation, optimized cooling system, and storage with good capabilities.
Color checker
[edit]As one of the activities involving the world of photography, the images produced in digitizing manuscripts must also have high color accuracy. One tool that can be used to measure the desired level of accuracy of digitized images is a color checker. Wikisource Loves Manuscripts in Indonesia, in this case, uses the QPCard 101 color checker, which has been customized in size so that it doesn't take up too many fields in an image.
<image here>
Storages
[edit]To store digitized manuscript data, three storage devices are needed, such as 1 (one) SD Card used for DSLR cameras, 2 external hard drives with a capacity of 2TB (terabytes) each. The Wikisource Loves Manuscripts team is equipped with a storage device with the following specifications.
Camera memory card
- Sandisk Extreme Pro SDXC UHS-I U3 V30
- 64 Gigabytes storage
- Read Up To 170MB/s/ Write Up To 90MB/s
- Video Speed Class V30
- 4K Ultra HD (3840 x 2160p); Full HD (1920 x 1080p)
- Compatible with SDXC and SDXC-I (UHS-I) enabled host devices
- Shockproof, X-ray proof, and Waterproof
<image here>
External disk drive
- Capacity - 4TB
- Connectivity
- USB 3.0
- USB 2.0
- Format exFAT (Windows and Mac)
Software
[edit]In operating the camera and processing images digitized from manuscripts, at least two main software are needed, namely the remote shooting application and image processing according to the camera brand used. If you are using a Canon camera, you need the Canon Digital Professional Plus 4 and EOS Utility apps installed on your computer, and if you are using a Nikon camera, you must have the Camera Control Pro 2 application installed on your computer.
<image here>
Apart from the applications that are included in the purchase of each camera brand, both Canon and Nikon, applications that must be installed are mass image processing applications such as ACDSee Photo Studio. It is highly recommended to install the application in the latest version.
<image here>
This mass image processing application is used to manage files in large quantities efficiently and optimally. In addition, these two applications have detailed metadata information from images produced during the process of digitizing manuscripts.
Cleaning equipment
[edit]In addition to the tools mentioned above, the team is also equipped with the following tools for cleaning manuscripts.
- Microfiber cloth
- Soft brush
- Gloves and Masks
- Hazelnut paste
- Lemongrass oil (Cymbopogon citratus oil)
- Mini-USB dust removal
<images here>
Supportng tools
[edit]Digitization activities often reach areas that have difficult terrain. The team faced a situation that was not ideal but a rescue mission for manuscripts in a collection had to be carried out immediately. Therefore, the equipment carried must anticipate things that hinder the course of manuscript digitization activities. In addition to the core tools, the Wikisource Loves Manuscripts team is equipped with the following tools.
- Pinset
- Black cloth
- Clear nylon thread
- Sponges
- Duplex board-paper
- Extension cables
- LED box
- Measuring tape
<images here>
Preparing The Manuscripts
[edit]In general, there are three types of materials commonly used in writing manuscripts in Indonesia. First, manuscripts made from paper such as tree bark paper, European paper, lined paper, etc. Secondly, manuscripts made from leaves such as palm leaves. Third, texts written on wooden materials such as bamboo, rattan, and the like.
Before digitizing, the team had to carry out an inventory or sort the manuscripts in a collection based on their condition, especially in the field of text. This aims to separate manuscripts that have been damaged from manuscripts that are still in good condition. In addition, this separation also aims to prevent more damage from a collection
Before starting digitization, the team must ask the owner for permission to move the manuscript from the repository to the digitization room. Make sure manuscripts are carried with extreme care. Not all existing manuscripts can be digitized. Apart from having to obtain permission from the owner, the team also had to select the feasibility of digitizing a manuscript based on its content and physical condition. Team members should select those manuscripts that contain important points. The criteria that can be considered are:
- It has local value;
- It have a significant academic impact;
- It is unique compared to similar texts;
- Other considerations that can be accounted for.
Apart from the contents, the team also needed to consider the type of manuscript to be digitized. Make sure the manuscript is written by hand, not printed (lithograph), and comes from the period before the printing era.
The condition of the manuscript also needs to be taken into consideration. In consideration of its merits, Wikisource Loves Manuscripts will only digitize manuscripts that are in good condition and are at risk of damage. As for manuscripts that have been damaged, they do not need to be digitized.
Paper Manuscript Handling
[edit]After the manuscript is brought to the digitization site, clean all manuscripts from dust and dirt from all parts of the manuscript. Manuscript cleaning is done using a soft brush. Everyone in the room or in the closest position to the manuscript cleaning activity must wear a mask. For those in charge of cleaning manuscripts, they are required to wear gloves. This aims to minimize the adverse effects of dust from cleaning manuscripts on breathing and skin irritation.
<image here>
Palm-leaf Manuscript Handling
[edit]Traditionally, writing text on palm leaves uses a small knife with a sharp tip and scratches on the surface of the palm leaves. The strokes that form certain texts are then smeared with black roasted candlenut so that they fill the pores of the scratches.
The handling of manuscripts made of palm leaves is different from the handling of manuscripts made of paper. Often manuscripts made from lontar are found in a moldy condition as a result of being stored in damp places and ink that is no longer visible on the surface of the lontar leaves. Over time and improper storage, the black ink from the roasted hazelnut fades.
Therefore, the steps for handling manuscripts made from palm leaves are as follows:
- Separate the lontars which are damaged from those which are in good condition and suitable for digitization; remove the palm leaf string if it still has the wooden cover and binding strap;
- After being separated from each other, the palm plates are cleaned of dust or mold on the surface using a soft brush or microfiber cloth;
- After cleaning from dust, apply the paste of roasted candlenut to blacken the text on the surface of the palm leaves again;
- Prepare enough citronella oil. Then wet the microfiber cloth with the citronella oil and gently apply it to the surface of the palm leaves that have been blackened with the roasted candlenut paste;
- After smearing with citronella oil, dry the palm leaf plates without exposure to sunlight;
- After the palm leaf plates have dried and the faded text is clearly visible, digitization can be done.
<image here>
Identifying Physical Information of the Manuscripts
[edit]After the manuscript is cleaned, the next step is to carry out a codicological analysis of the physical manuscript. Physical analysis of manuscripts is to be carried out by academic experts and their assistants by filling out available metadata forms either printed or digitally. Guidance on filling in physical metadata will be explained in a separate chapter.
Counting Manuscript Pages
[edit]Before being digitized, the team members were tasked with counting the number of pages of the manuscript along with its cover. Then, make sure that the number matches the number of digital images produced. The team also had to determine the order in which the manuscripts would be photographed, and mark the start or end of the different texts (if more than one text in them), to be digitized.
<image here>
For this reason, Wikisource Loves Manuscripts in Indonesia has provided manuscript identification numbers to determine the sequence of manuscripts that must be photographed. Paste the manuscript identification number sticker on the cover of the manuscript.
Studio Setup for Digitizing the Manuscripts
[edit]After preparing all the devices needed in digitization activities, the next step is to understand how to install these devices. The steps that must be passed are the installation of copy stands, lighting equipment, cameras, camera software, and camera settings.
How to Start Digitizing the Manuscripts
[edit]After the tools for digitizing manuscripts, both software and hardware, are properly installed, digitization activities can be carried out. However, before doing so, this section will explain the principles of digitization that comply with the provisions of the Wikisource Loves Manuscripts.
Basic Principles of Digitization Techniques
[edit]One Shoot-One Page and One Shoot-Two Pages
[edit]The basic principle of the digitization technique adopted by Wikisource Loves Manuscripts is One Shoot/One Page. This means that the photographer only shoots one page in one camera shot so that the number of digital images produced is directly proportional to the number of manuscript pages. This principle applies to manuscripts in the form of books and paper.
If the size of the manuscript is too small so that the camera lens cannot reach it, then the principle used is One Shoot/Two Pages. The consequence of this principle is that the recto-verso folder system does not apply. However, these two principles will not be applied if the object is a non-book manuscript.
<image here>
Project Numbering Card
[edit]To ensure that the digitized manuscripts match the predetermined numbering, the project numbering card is placed on the cover page. If the manuscript is made of palm leaves, cards are placed on each page side by side with the QP Card.
<image here>
Using QP Card
[edit]The QP Card is a card that functions to calibrate the colors produced by digital manuscripts. Every time a photograph is taken, the photographer must place it at the bottom of each manuscript page. If the object is made of palm leaves, then it is placed on the bottom-right of the manuscript side by side with the project numbering card.
In addition to having three colors and field size indicators, the QP Card also has a small arrow. This mark indicates the top of the manuscript, not the orientation of the text.
Taking Shots
[edit]- Put the project number above the manuscript you want to photograph. This aims to facilitate the identification and filling of metadata. <image here>
- Fill in the physical metadata of the manuscript on the 'metadata' form.
- Place the manuscript and QP Card under the camera. Make sure the QP Card position is aligned with the thickness of the manuscript. If the manuscript is too thick, the QP Card must be placed on cardboard that is stacked as high as the thickness of the manuscript and covered with a black cloth. <image here>
- Click the 'live view shoot' menu on the camera remote. Make sure the position of the manuscript fills all parts of the image field. Don't have too many blanks. Manuscripts must be straight and symmetrical.<image here>
- You can drag up and drop down the position of the camera attached to the copy stands by rotating the pulley to adjust the scope of the object.
- Once the manuscript is in perfect position, close the 'Remote Live View Window'.
- Point the mouse cursor on the remote camera shutter. Pay attention to the 'Beep' sound which is a sign that the camera focus is working properly. After that click the 'shutter' button to start shooting.
- Open 'Digital Photo Professional 4', to view the manuscript photo results. Make sure there are no objects blocking the object photo.
- One shot will produce two types of image files, namely .CR2 and JPG.
- Continue shooting until finished.
The sequence of photo shots of the manuscript parts
[edit]Photographers must digitize in the following order:
- The part that was first photographed was 'Identity'. All photos are stored in the 'Identity' folder. Here is the order:
- Front Cover
- Back Cover
- Front Pastedown
- Back Pastedown
- Watermark
- After all the 'Identity' parts have been photographed, the next step is to shoot the 'Recto' part. <image here>
- After the 'Recto' part is taken in its entirety, the 'Verso' part is taken and put in the Verso folder.
Important Things in Digitalization
[edit]In digitizing, the photographer must pay the attention to the note some of the following:
- Before shooting as a whole, make a sample of the photos to check the level of color balance (calibration).
- Check the framing of the image.
- Image results must be proportional, not concave/convex.
- The distance between the QP Card and the manuscript is approximately 1 cm.
- Check the resulting image at 100% zoom level.
- Check the focus.
- In "View", select "grid lines" to ensure the position of the image is straight and angled.
- Check the image quality after every 20 shots.
Color Calibration
[edit]As previously stated, the photographer must take a sample photo to measure the level of color balance. If the portion of the color has been declared balanced, shooting can be continued.
<image here> Here's how to measure the color balance of the digitized result:
- After making a sample, open the digitized results through the 'Digital Photo Professional' application. Double click on the JPG image and hover over the gray color on the QP Card.
- Pay attention to the red, green, and blue color channels (Red, Green, Blue, RGB) as shown above. The number 118 indicates the red (red) color channel, the number 127 indicates the green (green) channel, and the number 144 indicates the blue (blue) channel.
If a discrepancy is found in the results, you can pay attention to the following provisions:
- The difference in RGB values must be ensured to be within the range of 7 points. That is, the difference between the highest and lowest values on the three channels cannot be more than 7 points (< 7).
- If the difference is more than 7 points, the photographer can adjust the temperature or color temperature in the camera settings.
- Hint: If the B (Blue) value is too high, set the color temperature (degrees Kelvin) to a higher number and try until it fits. If R (Red) and G (Green) are higher than B (Blue), set the temperature to a lower number (example: 5500).
If the gray balance has been obtained, the next step is to check the white balance. Point the mouse at the white color found on the QP Card. Then pay attention to the RGB color balance adjusted according to the following conditions:
- The value of all three channels must be in the range between 230 & 239.
- To change the values of the three color calibration channels, you can adjust the intensity of the flash light.
- The intensity level of both flash lamps must be the same.
Photographing Watermarks and Paper Countermarks
[edit]If paper identity is found in the form of watermarks and countermarks in the manuscript, the steps are as follows:
- Prepare the LED lighting board and connect it to the provided USB Hub. Press the power on button so that the LED lighting board lights up. <image here>
- If there are many watermarks or countermarks in one manuscript, look for a sheet of the manuscript that has a watermark or countermark with the most intact image as a sample to take a picture of.
- Turn off the flashlight.
- After it lights up, place the LED lighting board onto the sample manuscript sheet that has a watermark or countermark. <image here>
- Set the camera's ISO from 100 to 2500. You can adjust the conditions until the watermark results are clearly visible.
- Click the camera shutter button in the 'Remote Shooting' dialog box.
- Make sure the image file goes into the 'IDENTITY' folder. <image here>
- Shooting of more than one watermark in one manuscript is allowed.
Special Techniques
[edit]Damaged Manuscripts
[edit]If the condition of the manuscript has holes or tears, then place a black cloth on the back of the page to be photographed. This is done so that the part of the page that is behind the hole is not photographed. In addition, this is done so that the text on the back of the page with holes or tears does not interfere with reading the text on the previous page. <image here>
Non-Paper, Folded, or Rolled Manuscripts
[edit]The way to photograph folded manuscripts or those made of bamboo, wood, or scrolls is to adjust the range of the camera lens. The direction and sequence of shooting also follow the orientation of the text. The consequence is that folderization no longer uses the recto and verso system.
<images here>
Palm Leaf Manuscripts
[edit]Digitization of palm leaf manuscripts uses the principle of one shoot/one page on the first and last leaf. However, from the second leaf the principle of one shoot/two pages is applied. No recto and verso system needed.
<image here>
Folder and File Management
[edit]In the effort to digitize manuscripts, folder management is crucial. Good folder and file management will minimize unexpected things like data loss, data corruption, and others. The steps that must be followed by the photographer team are as follows:
- Create 3 (three) folders consisting of 1 (one) 'Master data' folder, 1 (one) 'Data Renaming' folder, and 1 (one) 'Final Backup' folder.
- Save the 'Master Data' folder to a separate hard drive.
- Always apply the file changes in the 'Renaming' folder. When there is an error in file management, you can retrieve it from the 'Master Data' folder.
The provisions for naming folders and files are as follows:
- Numbering and Naming Folder:
- WiLMa Id 001 0001
- Use spaces to separate the numbering codes.
- WiLMa: Wikisource Loves Manuscripts.
- ID: ISO 3166 code for Indonesia
- 001: manuscript owner code (3 digits).
- 0001: Digitized manuscripts code (4 digits).
- File Renaming
- WiLMa_ID_001_0001_001r
- WiLMa_ID_001_0001_001v
- Use underline (_) to separate the numbering codes.
- WiLMa: Wikisource Manuscripts.
- ID: ISO 3166 code for Indonesia
- 001: manuscript owner code (3 digits).
- 0001: Digitized manuscripts code (4 digits).
- 001: manuscript page
- r/v: recto/verso
This numbering is done automatically using the ACDSee Pro 8 software which is already installed in a special laptop for digitizing Wikisource Loves Manuscripts.
This software is used to assist the numbering system which cannot be done automatically by the camera. Another thing to note is that this numbering is only used for manuscripts that use the recto-verso folder system. As for the 'identity' folder, the numbering is done manually. Following are the numbering technical stages:
- Click the icon on the laptop's taskbar to open ACDSee Pro 8. Select the manuscript folder where you want to change the number. Start from the recto folder, then continue with the verso folder.
- Select all files (both CR2 and JPG types) in the folder by pressing Ctrl and A simultaneously. Then, right click and select 'rename'. Then it will appear like this:
- Next we will do the file numbering settings. Select the tab: Templates. Checklist: Use Template to rename files.
- Write a prefix in the form of a numbering code, followed by ### and ending with 'r' for recto or 'v' for verso.
- Select: Start at: Fixed value 1. Notice the new numbering preview in New Name. Make sure the photographer refers to the file numbering system mentioned above.
- If everything is in accordance with the provisions, click Start Rename. The numbering process is complete. Apply to all folders.
If the recto and verso numbering is complete, the next step is to change the numbering in the 'identity' folder. As previously stated, file numbering in this folder is done manually. Here are the conditions:
Manuscript pages | Numbering code |
---|---|
Front cover | WiLMa_ID_001_0001_fc |
Back cover | WiLMa_ID_001_0001_bc |
Front pastedown | WiLMa_ID_001_0001_fp |
Back pastedown | WiLMa_ID_001_0001_bp |
Watermark* | WiLMa_ID_001_0001_w |
*If there is more than one watermark, use the code w1, w2, w3, and so on.
Quality Control
[edit]Photographers are required to check digitization results regularly. In addition to monitoring image quality every 20 shots, checks are also carried out every time a bundle of manuscripts is photographed. This is done for time and cost efficiency rather than conducting the inspection after the digitization mission has been completed.
The first element to look at is image quality. This includes the display quality of digital images and color balance. Software such as Digital Photo Professional 4 and ACDSee Pro 8 (or later versions) can be used for this check.
The quantity of images also needs to be considered. Make sure the number of digitized files matches the number of original manuscript pages. If shooting uses the One Shoot/One Page principle, the number of files in the Recto and Verso folders must be the same (identical). The following are some of the problems that are often found in digitizing manuscripts:
Problems | Troubleshooting |
---|---|
Image missing/no photo taken | Detect on the missing pages. Then shoot the missing page and name the file according to the sequence by giving the letter 'a'. |
Image captured more than once | Erase images in both the CR2 and JPG versions. Adjust with proper numbering. |
Image is out of focus | Reshoot the problem page. |
Another object is photographed | Reshoot the problem page. |
Disproportionate image | Reshoot the problem page. |
QP Card not visible or touching the manuscript | Reshoot the problem page. |
A part of the image section is missing and out of the photo | Reshoot the problem page. |
Fill out the Quality Control Data Management Form
[edit]In addition to submitting digital manuscript files, photographers are also required to fill out a Data Management Check-Sheet. Filling in this form is intended to ensure that all stages of digitization have been carried out properly.
This form is filled out each time a photographer completes a photo shoot of a bundle of manuscripts before moving on to the next photo shoot. This form can be downloaded online via:
PDF versions
[edit]The PDF document currently available is version 1.0, which is accessible in Arabic, English, Hindi, Indonesian, and Spanish. The content of this guide will be updated regularly to add to or improve its substance.
-
Arabic
-
English
-
Hindi
-
Indonesian
-
Spanish
Supplements
[edit]Other supporting documents for the guideline are availabel under Category:Wikisource Loves Manuscripts guidelines.