SpeakReader has been installed and tested on Windows 10 and Ubuntu 18.04 using Python 3.7.3.
Install the latest version of Python 3. Download the Windows installer and complete the installation with all the default options.
- Download SpeakReader from GitHub: https://github.com/zSeriesGuy/SpeakReader/zipball/master
- Extract the ZIP file.
- Open a CMD window.
- CD to the directory where you unzipped SpeakReader.
- Type:
python3 -m venv .\venv
- Type:
.\venv\scripts\activate
- Type:
python -m pip install --upgrade pip
- Type:
pip3 install --upgrade setuptools wheel pip-tools
- Install the appropriate PyAudio wheel package for your release of Windows and Python. Several are provided in the PyAudioWheels folder. For example, to install the 32-bit version for Python V3.7:
- Type:
pip3 install .\PyAudioWheels\PyAudio-0.2.11-cp37-cp37m-win32.whl
- Type:
- Type:
pip3 install -r requirements.txt
- Type:
.\venv\scripts\python start.py
to start SpeakReader. - SpeakReader will be loaded in your browser or listening on http://localhost:8880
- You can change the port in the Settings tab of the management page or by editing config.ini.
NOTE: This method is preferred because it adds the support to Windows to be able to more reliably update SpeakReader. This will install extra shell extensions and make adjustments to your path environment.
- Go to https://gitforwindows.org/ and download git.
- Run the installation with the default options.
- Right click on your desktop and select "Git Gui".
- Select "Clone Existing Repository".
- In the "Source Location" enter: https://github.com/zSeriesGuy/SpeakReader.git
- In the "Target Directory" enter a new folder where you want to install SpeakReader to (e.g. C:\SpeakReader).
- Click "Clone".
- When it's finished a Git Gui windows will appear, just close this window.
- Go to Method 1 and continue at the "Open a CMD Window".
SpeakReader will be installed to /opt/SpeakReader
.
- Open a terminal
- Install prerequisites:
- Ubuntu/Debian:
sudo apt-get install git-core
sudo apt-get install build-essential libffi-dev libssl-dev libxml2-dev libxslt1-dev libjpeg8-dev zlib1g-dev alsa-utils libatlas-base-dev portaudio19-dev
sudo apt-get install python3 python3-venv python3-dev
- Ubuntu/Debian:
- Type:
cd /opt
- Type:
sudo git clone https://github.com/zSeriesGuy/SpeakReader.git
- Type:
cd SpeakReader
- Type:
python3 -m venv /opt/SpeakReader/venv
- Type:
source /opt/SpeakReader/venv/bin/activate
- Type:
pip3 install --upgrade pip setuptools wheel pip-tools
- Type:
pip3 install -r /opt/SpeakReader/requirements.txt
- Type:
sudo usermod -aG audio {userid}
where {userid} is the user that you signed in to the terminal with. - Type:
/opt/SpeakReader/venv/bin/python3 /opt/SpeakReader/start.py
to start SpeakReader - SpeakReader will be loaded in your browser or listening on http://localhost:8880
-
Ubuntu/Debian:
sudo cp /opt/SpeakReader/init-scripts/speakreader.service /lib/systemd/system
sudo systemctl daemon-reload
sudo addgroup speakreader && sudo adduser --system --no-create-home speakreader --ingroup speakreader
sudo usermod -aG audio speakreader
sudo chown -R speakreader:speakreader /opt/SpeakReader
sudo systemctl enable speakreader
sudo systemctl start speakreader
If you configure SpeakReader to listen on port 80 or 443 (or any port below 1024), you will need to run SpeakReader as root user.
- Edit
/lib/systemd/system/speakreader/service
and change user and group to root. sudo systemctl daemon-reload
sudo systemctl restart speakreader
Google API, IBM Watson, and Microsoft Azure are supported.
Not all transcription service providers are available on all platforms. For example, Microsoft Azure Speech Services is currently only available for Windows 32-bit and 64-bit, MacOS, and Linux 64-bit on non-ARM processors.
You won't be able to start the transcribe engine until you have provided a valid Google API credentials JSON file.
NOTE: The Google Speech-To-Text API service is not free. But it is not very expensive, less than $1 US per hour. See the pricing.
- Go to the Google API Console. Sign in with your Google Account if you have one or create one.
- Create a billing account by clicking on Billing and set up your payment method.
- Return to the Home page.
- At the top of the screen next to Google Cloud Platform, click the Select Project.
- In the popped up window, click NEW PROJECT in the upper right.
- Give the project a name or take the default and click CREATE.
- Back on the Home page, if the selected project is not your newly created project, select it from the top of the screen next to Google Cloud Platform.
- Select APIs & Services.
- Click on the ENABLE APIS AND SERVICES at the top of the screen.
- Search for Speech and click on Cloud Speech-to-Text API.
- Click on ENABLE to enable this API for your project.
- On the APIs & Services screen, click Credentials.
- Click CREATE CREDENTIALS at the top of the screen and choose Service account key.
- Create a Service Account with a Role of Service Usage Consumer. The Key Type should be JSON.
- After you click the Create button, a json file will be downloaded to your computer.
- Go to the SpeakReader management console Settings page and upload this json file.
- You should now be able to start the transcribe engine.
HINT: When you go to the Google API Console, there may be at the very top of the screen something about Free Trial. You can activate that to get $300 credit good for one year. This will allow you to test and use SpeakReader at no cost for one year or $300 worth, whichever comes first.
You won't be able to start the transcribe engine until you have provided a valid IBM Cloud API credentials ENV file.
NOTE: The IBM Cloud Speech-To-Text API service is not free. But it is not very expensive, less than $1 US per hour. See the pricing. There is a Lite level that includes 500 minutes per month FREE.
- Go to the IBM Cloud API Dashboard. Sign in with your IBM Cloud Account if you have one or create one.
- Set up your billing method by clicking on Manage->Billing and Usage. Then click on Payments.
- Next, set up your resource by clicking on Resource List. Then click Create Resource (upper right).
- Search for speech to text. It should find AI Speech to Text. Click on it. The free Lite plan is a good start.
- Once you have completed creating the resource, click Resource List at the top. Then choose Manage on the menu at the left.
- It should be showing you a screen with the API Key. Click Download to save the credentials ENV file to your computer.
- Go to the SpeakReader management console Settings page and upload this ENV file.
- You should now be able to start the transcribe engine.
You won't be able to start the transcribe engine until you have provided a valid Microsoft Azure APIKEY and Region setting.
NOTE: The Microsoft Azure Speech-To-Text API service is not free. But it is not very expensive and includes a free tier. See the pricing. There is a Free tier that allows for 5 audio hours per month.
- Check out the Getting Started page for more instruction detail.
- Go to the Azure Portal Dashboard. Sign in to your Azure Account.
- Click the Create a resource in the top of the left menu.
- Search for Speech and click on Speech.
- Click Create
- Once you have completed creating the resource, you can click on All resources in the left menu, then select your resource.
- In the resource page, you will find the APIKEY in Quick start
- Copy and paste the APIKEY into the SpeakReader management console settings page after choosing the Microsoft transcription service.
- Set the Service Region that you chose when you created the Azure resource.
- You should now be able to start the transcribe engine.
If you installed SpeakReader via GIT, you will be notified when upgrades are available on the management console's Settings page.
Upgrading will run an environment sync. On some platforms, this may run a while. So if the upgrade process does not restart the management console, you can check the log in the data/logs folder.