Add CodexBackup class #32

elisno · 2025-02-11T22:53:22Z

Followup to #31.

Files copied from #11.

Specifically, from: 49f9a9d

axl1313 · 2025-02-12T02:07:23Z

src/cleanlab_codex/codex_backup.py

+        if self._primary_system is not None:
+            self._backup_handler(
+                codex_response=codex_response,
+                primary_system=self._primary_system,
+            )


If I was creating a RAG system for a production app, I think I'd be much more likely to implement handling replacement of original response either just within the chat method or as a separate instance method of the class. Without this CodexBackup class, I'd probably do something along the lines of:

class RAGChatWithCodexBackup(RAGChat): def __init__(self, client: OpenAI, assistant_id: str, codex_access_key: str): super().__init__(client, assistant_id) self._codex_project = Project.from_access_key(access_key) def replace_latest_message(self, new_message: str) -> None: <code from your handle_backup_for_openai_assistants method> ... def chat(self, user_message: str) -> str: response = super().chat(user_message) codex_response: str | None = None if is_bad_response(response=response, query=user_message): codex_response = self._codex_project.query(user_message) if codex_response is None: return response self.replace_latest_message(codex_response) return codex_response

Very unlikely that I'd define a method that fits the expected signature for _backup_handler

Using the CodexBackup class you've defined without providing backup_handler and primary_system, I'd probably end up with the following:

class RAGChatWithCodexBackup(RAGChat): def __init__(self, client: OpenAI, assistant_id: str, codex_access_key: str): super().__init__(client, assistant_id) self._codex_backup = CodexBackup.from_project(Project.from_access_key(codex_access_key)) def replace_latest_message(self, new_message: str) -> None: <code from your handle_backup_for_openai_assistants method> ... def chat(self, user_message: str) -> str: response = super().chat(user_message) backup_response: str | None = self._codex_backup.run(response=response, query=user_message) if backup_response is not None and backup_response != response: self.replace_latest_message(backup_response) return backup_response return response

which does save a couple lines of code, but not much.

Wondering:
a) Whether it's worth trying to do the backup_handler stuff within this class (maybe could modify the expected function signature to allow for class instance methods).
b) Whether it's worth having this class right now (doesn't really save much code). But could potentially make the second example a little cleaner by modifying CodexBackup.run() to return a pair of backup_response, codex_used and then could just do if codex_used: self.replace_latest_message(backup_response)

Discussed on slack. Will leave this implementation for now to avoid extra work of updating tutorials before soft launch deadline.

I'm putting this PR on hold then.
Moving the code into the tutorial now.

axl1313 · 2025-02-12T04:51:17Z

src/cleanlab_codex/codex_backup.py

+        tlm: Optional[TLM] = None,
+        is_bad_response_kwargs: Optional[dict[str, Any]] = None,


What's the reasoning for tlm being a separate argument but the rest of the arguments for is_bad_response being grouped into is_bad_response_kwargs?

This should now be is_bad_response_config: BadResponseDetectionConfig. You're right, the tlm argument (and fallback_answer) should be fetched from the config instead.

axl1313 · 2025-02-12T05:12:40Z

src/cleanlab_codex/codex_backup.py

+        if self._primary_system is not None:
+            self._backup_handler(
+                codex_response=codex_response,
+                primary_system=self._primary_system,
+            )


Discussed on slack. Will leave this implementation for now to avoid extra work of updating tutorials before soft launch deadline.

elisno added 2 commits February 11, 2025 22:34

move TLM protocol definition into types directory

a1f03a5

add CodexBackup class

05f7a09

elisno requested a review from axl1313 February 11, 2025 22:53

formatting

bdf8872

axl1313 reviewed Feb 12, 2025

View reviewed changes

jwmueller marked this pull request as draft February 13, 2025 06:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CodexBackup class #32

Add CodexBackup class #32

elisno commented Feb 11, 2025

axl1313 Feb 12, 2025

axl1313 Feb 12, 2025

axl1313 Feb 12, 2025

axl1313 Feb 12, 2025

elisno Feb 12, 2025

axl1313 Feb 12, 2025

elisno Feb 12, 2025

axl1313 Feb 12, 2025

		tlm: Optional[TLM] = None,
		is_bad_response_kwargs: Optional[dict[str, Any]] = None,

Add CodexBackup class #32

Are you sure you want to change the base?

Add CodexBackup class #32

Conversation

elisno commented Feb 11, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment