Skip to content

Visual-to-Voice is a lightweight, open-source project to convert PDF and image text into speech. Using advanced OCR and text-to-speech technology, it ensures fast, accurate, and natural voice output. Perfect for enhancing accessibility and productivity.

Notifications You must be signed in to change notification settings

Jakiur1234/VisualToVoice

Repository files navigation

Installation Guide

1. Install Homebrew

If you don’t have Homebrew installed, run the following command to install it:

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

For more information, visit Homebrew's official website.

2. Install Tesseract

Once Homebrew is installed, you can install Tesseract OCR by running:

brew install tesseract

For more details about Tesseract installation, visit Tesseract Formula.

3. Install Composer

Ensure you have Composer installed, and PHP version >= 8.2. To install Composer, follow the instructions on getcomposer.org.

Usage Instructions

1. Clone the Repository

Clone the repository to your local machine:

git clone https://github.com/Jakiur1234/VisualToVoice

2. Navigate to the Project Directory

Move into the project directory:

cd VisualToVoice

3. Install Dependencies

Run the following command to install all the required dependencies:

composer install

4. Start the PHP Development Server

To start the PHP development server, run:

php -S localhost:8080 -t public

5. Access the Application

Open your browser and go to http://localhost:8080 to access the application and start using it.

About the Project

This project is built with plain PHP. After spending a long time working with Laravel, I’m revisiting PHP’s Object-Oriented Programming (OOP) and procedural programming paradigms.

In the near future, I plan to create a simple yet powerful PHP framework and rebuild this project using that framework. Stay tuned for updates, inshallah!

Features

  • OCR (Optical Character Recognition): Converts images and PDFs to text.
  • Multi-format Support: Handles PDFs, JPG, JPEG, PNG files.
  • Custom Language Support: Allows the user to specify the OCR language.
  • Easy Installation: Simple setup with just a few commands.

Contributing

If you'd like to contribute to this project, feel free to fork the repository, make changes, and create a pull request. All contributions are welcome!

License

This project is licensed under the MIT License - see the LICENSE file for details.

Enhancements Made:

  1. Structured the installation process with clear headings.
  2. Added links to the official resources (e.g., Homebrew, Tesseract, Composer).
  3. Improved readability with a clean format and added more context about the project.
  4. Included a "Features" section to highlight the project's capabilities.
  5. Included a "Contributing" section to encourage open-source collaboration.
  6. Added a "License" section to make the project's license clear.

Let me know if you need any more adjustments!

About

Visual-to-Voice is a lightweight, open-source project to convert PDF and image text into speech. Using advanced OCR and text-to-speech technology, it ensures fast, accurate, and natural voice output. Perfect for enhancing accessibility and productivity.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages