add support for raw text file as input #49

mobarski · 2024-10-13T09:54:39Z

I've added the ability to use raw text file as the input as it makes podcastfy much more versatile.

Before we could either use a list of urls, for which the content generator was used, or the final transcript. There was to ability to use raw text for the content generator. This PR adds raw_text param to the process_content function and raw_file to the CLI (I've tried to keep the naming convention).

brumar · 2024-10-13T20:43:37Z

@souzatharsis @mobarski raw_text absolutely needs to be supported, I think local files already does support local files which are real urls in the end. but I think the signature of generate_podcast starts to look weird. As "urls" already do some magic by determining the parsing strategy, can we go one step further and replace url kwarg by something more generic like "content". It would do a best effort do determine content_type. Alternatively, it could be possible to pass something explicit like contents=[Content(type=podcastify.types.raw, target="raw stuff"), Content(type=podcastify.types.localfile, target "/..."]

Maybe this very simple content abstration would be useful in my design PR too @souzatharsis ?

souzatharsis · 2024-10-13T23:54:38Z

podcastfy/client.py

@@ -97,6 +103,9 @@ def main(
 	transcript: typer.FileText = typer.Option(
 		None, "--transcript", "-t", help="Path to a transcript file"
 	),
+	raw_file: typer.FileText = typer.Option(
+		None, "--raw-file", "-r", help="File containing raw text"


this is a bit confusing: the user is passing raw text but the flag is raw file. It should be consistent. Consider renaming the flag to simply --text

souzatharsis · 2024-10-13T23:56:23Z

podcastfy/client.py

@@ -182,6 +198,7 @@ def generate_podcast(
 	Args:
 		urls (Optional[List[str]]): List of URLs to process.
 		url_file (Optional[str]): Path to a file containing URLs, one per line.
+		raw_text (Optional[str]): Text to process.


how about calling it simply text, instead of raw_text

souzatharsis · 2024-10-13T23:57:39Z

podcastfy/client.py

 			else:
 				combined_content = ""  # Empty string if no URLs provided

 			# Generate Q&A content
 			random_filename = f"transcript_{uuid.uuid4().hex}.txt"
 			transcript_filepath = os.path.join(config.get('output_directories')['transcripts'], random_filename)
 			qa_content = content_generator.generate_qa_content(
-				combined_content, image_file_paths=image_paths or [], output_filepath=transcript_filepath
+				combined_content,
+				#image_file_paths=image_paths or [],  # FIXME


Wouldn't commenting this prevent podcastfy from generating audio from images?

souzatharsis

minor suggested update in CLI / package args

major question regarding line removed that would prevent podcastfy from generating audio from images

souzatharsis · 2024-10-14T00:01:47Z

Additionally, @brumar has a good point. The user interface, its args, their combination and validation are getting complicated. However, I'd suggest adding this suggested option (of input raw text) and then next and separately refactor the code to improve it following @brumar's recommendations.

add support for raw text file as input

5580a54

souzatharsis reviewed Oct 13, 2024

View reviewed changes

souzatharsis requested changes Oct 13, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for raw text file as input #49

add support for raw text file as input #49

mobarski commented Oct 13, 2024

brumar commented Oct 13, 2024

souzatharsis Oct 13, 2024

souzatharsis Oct 13, 2024

souzatharsis Oct 13, 2024

souzatharsis left a comment

souzatharsis commented Oct 14, 2024

add support for raw text file as input #49

Are you sure you want to change the base?

add support for raw text file as input #49

Conversation

mobarski commented Oct 13, 2024

brumar commented Oct 13, 2024

souzatharsis Oct 13, 2024

Choose a reason for hiding this comment

souzatharsis Oct 13, 2024

Choose a reason for hiding this comment

souzatharsis Oct 13, 2024

Choose a reason for hiding this comment

souzatharsis left a comment

Choose a reason for hiding this comment

souzatharsis commented Oct 14, 2024