בסיעתא דשמיא

How to Transcribe Jewish Liturgy and Liturgy Related Work

A fundamental value of our project is correctly attributing Jewish liturgy and liturgy related work. Even when the original author of a work is lost to history, we strive to record every adaptation and variation sourced within particular manuscripts and extant published works in the Public Domain.

To do this, our volunteers help produce transcriptions that are easily determined to be authentic witnesses of a given work, whether it is the earliest known version known, or some other variation. We use Wikisource, as the collaborative transcription and proofreading environment for transcribing Public Domain and free-culture licensed texts.

Completed transcriptions:

  • Stunden Der Andacht (Fanny Neuda, tkhines, German, 1858). Many thanks to Open Siddur Project volunteer, Chajm Guski!

Volunteers have begun to transcribe and proofread the following texts.

To begin transcribing and proofreading Jewish liturgy and liturgy related work, please complete the following steps:

  1. Install a Hebrew Keyboard layout for your Operating System. (We recommend the Biblical Tiro layout.) Refer to the key mapping images and familiarize yourself with the four levels of the Biblical Tiro keyboard layout.
  2. Download and install Unicode Hebrew Fonts supporting the full range of Hebrew diacritics. We recommend installing the Taamey Frank CLM font from the Culmus Project (available in the Open Siddur Font Pack).
  3. Configure your web browser to display Unicode Hebrew Fonts supporting the full range of Hebrew Diacritics. See below for specific details for changing the default Hebrew fonts displayed in Mozilla Firefox.
  4. Register a new user account with Wikisource.
  5. Login and set your preferred settings for the language and editing interface in Wikisource (see below).

Preparing Mozilla Firefox for Transcription in Wikisource

  • Download and install the fonts in the Open Siddur Unicode Hebrew Font Pack
  • Type “about:config” (without quotes) into the address bar. Click OK at any warnings.
  • Type “font*.he” (without quotes) into the search field.
  • Right click on the preference names appearing below, and replace their string value with that of your preferred Hebrew font supporting the full range of Hebrew diacritics. You can also change the integers of the font sizes that will appear in

Familiarizing Yourself with Wikisource

Login and Settings

  • To login to your account at Wikisource, click on the login link at the top right of the web page.
  • Click “My Preferences” in the top right corner. In Hebrew Wikisource, click “כניסה לחשבון” in the top left corner
  • Click “User Profile” to choose your preferred language for working within Wikisource’s interface. To navigate Hebrew Wikisource in English or another language, click “פרטי המשתמש” and the context menu next to “שפת הממשק” to select your preferred language.
  • Click “Editing” to set your preferred settings for using the editing interface. See below for my preferred settings to edit.

Using the Transcription Interface

The great things about collaborative transcription and proofreading is that you can correct other’s work and others can correct your errors. The key thing is to know how to navigate the Wikisource interface.

  1. To edit a page of text, click on the ‘Edit’ link (next to the ‘Read’ link) above the page image.
  2. When you are done editing or proofreading a page, don’t forget to indicate in what state you’ve left it. Reading the help page “Help:Editing” will help you better understand how Wikisource users track their transcription and proofreading.

What about OCR for Hebrew

With technology in its current state, manual transcription (typing) is the only reliable way to transcribe Hebrew text with vowels. Open source tools for the automated transcription of Hebrew are not capable of reliable conversion of images with Hebrew letters and diacritical marks into machine readable Hebrew text without requiring more work proofreading the text than would have been done transcribing it from scratch.[1] We have not found closed source tools to be any better. Until such tools improve, projects such as the Open Siddur must depend on the manual transcription of text by humans.

Notes:

  1. HOCR is available for testing on Linux. Unfortunately, an effort to continue Kobi Zamir’s work on hOCR stalled in 2010. A early version of hOCR compiled for use on Windows is available for download here.
 . Creative Commons Attribution-ShareAlike . 3.0 . Unported .
“How to Transcribe Jewish Liturgy and Liturgy Related Work” is shared by The Hierophant with a Creative Commons Attribution-ShareAlike 3.0 Unported license.
avatar

About The Hierophant


A hierophant is a person who invites participants in a sacred exercise into the presence of that which is deemed holy. The title, hierophant, originated in Ancient Greece and combines the words φαίνω (phainein, "to show") and ‏τα ειρα (ta hiera, "the holy"); hierophants served as interpreters of sacred mysteries and arcane principles. For the Open Siddur Project, the Hierophant welcomes new contributors and explains our mission: ensuring creatively inspired work intended for communal use is shared freely for creative reuse and redistribution. Aharon Varady, founding director of the Open Siddur Project, serves as hierophant, and administers opensiddur.org as its webmaster and editor-in-chief.

Related liturgy and liturgy-related work:

  • No related liturgy and liturgy-related work

1 comment to How to Transcribe Jewish Liturgy and Liturgy Related Work

Leave a Reply

בסיעתא דארעא