YouTube Transcriber

Installation

This repository includes couple options for transcription;

YouTube video transcriber
A video transcriber
Record your voice and transcriber afterwards
Transcribe while speaking

For each use case's you need to install couple different softwares/libraries.

The main use case is downloading a video / videos (as audio) from YouTube and transcribe them. For this use case, these steps should be followed;

clone the repository

 $ git clone https://github.com/erolrecep/Transcriber.git

Install SoX Swiss army knife for audio processing things

 $ brew install sox      				# For Mac Os X
 $ sudo apt install sox  				# For Ubuntu

create a new Python virtual environment

 $ conda create --name transcriber python=3.6
 $ conda activate transcriber
 $ conda install tensorflow==1.13.1 			# if you have GPU, then install *conda install tensorflow-gpu==1.13.1*  (surprisingly I like this version of tensorflow :) )
 $ pip install youtube-dl deepspeech==0.7.4  		# if you have GPU, then install *pip install deepspeech-gpu==0.7.4*

Download pre-trained DeepSpeech models from here
- This repository uses 0.7.4 version of the DeepSpeech, you can try the same setup with newer models.
- You need to download 0.7.4 pdmm file
- If you want, you can also download and load scorer provided by Mozilla, scorer
Now, the virtual environment is ready, the next step is running the project. For your convenience, I provided a sample .wav file so you can test your setup if it's working. Also, you can download audio files from here
```
 $ python run.py 					# This will read audio files from the *audio_locations.txt* file.
 $ python run.py -a audio_files/sample.wav 		# This will only run inference on this input .wav file
```

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
audio_locations.txt		audio_locations.txt
download.py		download.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

audio_locations.txt

audio_locations.txt

download.py

download.py

requirements.txt

requirements.txt

run.py

run.py

Repository files navigation

YouTube Transcriber

Installation

About

Releases

Packages

Languages

erolrecep/Transcriber

Folders and files

Latest commit

History

Repository files navigation

YouTube Transcriber

Installation

About

Resources

Stars

Watchers

Forks

Languages