Skip to content

Commit

Permalink
Update tafrigh version in Google Colab notebook
Browse files Browse the repository at this point in the history
  • Loading branch information
AliOsm committed Jul 1, 2024
1 parent bff5f04 commit cf8e30e
Showing 1 changed file with 29 additions and 20 deletions.
49 changes: 29 additions & 20 deletions colab_notebook.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
" <p>مرحبًا بك في تفريغ لتفريغ المواد الصوتية والمرئية باستخدام تقنيات الذكاء الاصطناعي. لاستخدام تفريغ:</p>\n",
"\n",
" <ol>\n",
" <li>قم بإدخال روابط المواد المطلوب تحويلها من منصة YouTube في حقل \"urls\" وتأكد من فصلها بمسافة، أو قم بترك الحقل فارغًا لتفريغ المواد التي قمت برفعها</li>\n",
" <li>قم بإدخال روابط المواد المطلوب تحويلها من منصة YouTube أو أي منصة أخرى في حقل \"urls\" وتأكد من فصلها بمسافة، أو قم بترك الحقل فارغًا لتفريغ المواد التي قمت برفعها</li>\n",
" <li>(اختياري) قم بتحديد أقل عدد من الكلمات في كل جزء من أجزاء التفريغ. يؤثر هذا في طول الأجزاء التي سيتم تفريغها</li>\n",
" <li>\n",
" إذا كنت تريد استخدام مجموعة نماذج Whisper:\n",
Expand Down Expand Up @@ -69,7 +69,7 @@
"print('جارٍ تجهيز بيئة العمل.')\n",
"\n",
"# Setup Tafrigh.\n",
"%pip install -U tafrigh[wit,whisper]==1.1.4 > install_logs.txt\n",
"%pip install -U tafrigh[wit,whisper]==1.5.0 > install_logs.txt\n",
"\n",
"# Start: Quick fix related to Colab, HuggingFace, and faster-whisper.\n",
"!apt install libcublas11 > fix_logs.txt\n",
Expand Down Expand Up @@ -134,24 +134,33 @@
" print('جارٍ تحويل المواد إلى نصوص باستخدام نماذج Whisper.')\n",
"\n",
"config = Config(\n",
" urls_or_paths=list(map(str.strip, urls.split(' '))) if len(urls.strip()) else ['.'],\n",
" skip_if_output_exist=False,\n",
" playlist_items='',\n",
" verbose=False,\n",
" model_name_or_path=model,\n",
" task='transcribe',\n",
" language=language,\n",
" use_faster_whisper=True,\n",
" beam_size=5,\n",
" ct2_compute_type='default',\n",
" wit_client_access_tokens=[wit_api_key],\n",
" max_cutting_duration=max_cutting_duration,\n",
" min_words_per_segment=min_words_per_segment,\n",
" save_files_before_compact=False,\n",
" save_yt_dlp_responses=False,\n",
" output_sample=0,\n",
" output_formats=['txt', 'srt'],\n",
" output_dir=output_dir,\n",
" input=Config.Input(\n",
" urls_or_paths=list(map(str.strip, urls.split(' '))) if len(urls.strip()) else ['.'],\n",
" skip_if_output_exist=False,\n",
" playlist_items='',\n",
" download_retries=3,\n",
" verbose=False,\n",
" ),\n",
" whisper=Config.Whisper(\n",
" model_name_or_path=model,\n",
" task='transcribe',\n",
" language=language,\n",
" use_faster_whisper=True,\n",
" beam_size=5,\n",
" ct2_compute_type='default',\n",
" ),\n",
" wit=Config.Wit(\n",
" wit_client_access_tokens=wit_api_key.split(),\n",
" max_cutting_duration=max_cutting_duration,\n",
" ),\n",
" output=Config.Output(\n",
" min_words_per_segment=min_words_per_segment,\n",
" save_files_before_compact=False,\n",
" save_yt_dlp_responses=False,\n",
" output_sample=0,\n",
" output_formats=['txt', 'srt'],\n",
" output_dir=output_dir,\n",
" ),\n",
")\n",
"\n",
"deque(farrigh(config), maxlen=0)\n",
Expand Down

0 comments on commit cf8e30e

Please sign in to comment.