feat: Add content filtering #33

KavithaSiva · 2024-07-26T08:39:16Z

Proposed Content filtering API:

Option 1: (Current approach)
This is based on the assumption that the orchestration service only respects the first configuration provided for a filter service provider.

client.chatCompletion({
    deploymentConfiguration: { deploymentId: 'id' },
      llmConfig: {
        model_name: 'gpt-35-turbo-16k',
        model_params: { max_tokens: 50, temperature: 0.1 }
      },
      prompt: {
        template: [{ role: 'user', content: 'Hello!' }],
        messages_history: [{ role: 'system', content: 'Test system message' }]
      },
      filterConfig: { 
        input: { azureContentSafety: { Hate: 0, SelfHarm: 2 }},
        output: { 
          azureContentSafety: { Sexual: 4, Violence: 6 },
          someOtherFilterServiceProvider: { Hate: 0, SelfHarm: 2 }
        },
      }
  })
);

Other options:
Option 2:

client.chatCompletion({
    deploymentConfiguration: { deploymentId: 'id' },
      llmConfig: {
        model_name: 'gpt-35-turbo-16k',
        model_params: { max_tokens: 50, temperature: 0.1 }
      },
      prompt: {
        template: [{ role: 'user', content: 'Hello!' }],
        messages_history: [{ role: 'system', content: 'Test system message' }]
      },
      filterConfig: { 
        input: { azureContentSafety: { Hate: 0, SelfHarm: 2 }},
        output: [
          { azureContentSafety: { Sexual: 4, Violence: 6 }},
          { someOtherFilterServiceProvider: { Hate: 0, SelfHarm: 2 }}
        ]
      }
  })
);

Option 3:

client.chatCompletion({
    deploymentConfiguration: { deploymentId: 'id' },
      llmConfig: {
        model_name: 'gpt-35-turbo-16k',
        model_params: { max_tokens: 50, temperature: 0.1 }
      },
      prompt: {
        template: [{ role: 'user', content: 'Hello!' }],
        messages_history: [{ role: 'system', content: 'Test system message' }]
      },
      filterConfig: { 
        input: { type: 'azureContentSafety', config: { Hate: 0, SelfHarm: 2}},
        output: [
          { type: 'azureContentSafety', config: { Sexual: 4, Violence: 6 }},
          { type: 'someOtherFilterServiceProvider', config: { Hate: 0, SelfHarm: 2 }}
        ]
      }
  })
);

KavithaSiva · 2024-07-26T08:50:41Z

With formatting though the proposed API still looks verbose, WDY think? is this an issue:

client.chatCompletion({
    deploymentConfiguration: { deploymentId: 'id' },
      llmConfig: {
        model_name: 'gpt-35-turbo-16k',
        model_params: { max_tokens: 50, temperature: 0.1 }
      },
      prompt: {
        template: [{ role: 'user', content: 'Hello!' }],
        messages_history: [{ role: 'system', content: 'Test system message' }]
      },
      filterConfig: {
        input: {
          AzureContentSafety: {
            Hate: 0,
            SelfHarm: 2
          }
        },
        output: [
          {
            AzureContentSafety: {
              Sexual: 4,
              Violence: 6
            }
          },
          {
            SomeOtherFilterServiceProvider: {
              Hate: 0,
              SelfHarm: 2,
              Sexual: 4,
              Violence: 6
            }
          }
        ]
      }
  })
);

ChristophMatthies · 2024-07-26T09:51:59Z

Suggestion:

filterConfig: { 
    input: { azureContentSafety: { Hate: 0, SelfHarm: 2 }},
    output: {
      azureContentSafety: { Sexual: 4, Violence: 6 },
      someOtherFilterServiceProvider: { Hate: 0, SelfHarm: 2 }
    }
}

marikaner · 2024-07-30T14:33:49Z

To be honest I am not happy with either of the options. If we simplify whatever is passed in the config, I think it is confusing if it is this close to the original, but still different. That opens potential for errors. I don't know whether this was already discussed before, but I would suggest one of the following options:

Keep the original spec from the orchestration service. This has the disadvantage, that it might be too verbose and difficult to use.
Breakout from the object structure and pass it through a more high level API. I could imagine something in this direction:

createFilterConfig({
  input: createInputFilter({
    type: 'azureContentSafety',
    config: { Hate: 0, SelfHarm: 2}
  }),
 output: createOutputFilter({
    type: 'azureContentSafety',
    config: { Sexual: 4, Violence: 6 }
  })
})

or

createFilterConfig(
  createInputFilter({
    type: 'azureContentSafety',
    config: { Hate: 0, SelfHarm: 2}
  ),
 createOutputFilter({
    type: 'azureContentSafety',
    config: { Sexual: 4, Violence: 6 }
  })
)

allow both (my favorite)

jjtang1985 · 2024-08-05T12:53:07Z

I would have the following:

[flexibility] Vanilla content filter payload structure (ref), which should be used by SDK internally and this internal function might also be used by the user, if they like.
[convenience] High level API, so user can pass a ContentFilterConfig object. (we also need to offer one or multiple ways to build the object, like constructor/factory)

This looks similar to what Marika mentioned.

KavithaSiva · 2024-08-05T14:02:54Z

I would have the following:

Vanilla orchestration service payload structure (ref), which should be used by SDK internally and this internal function might also be used by the user, if they like.

High level API, so user can pass a ContentFilterConfig object. (we also need to offer one or multiple ways to build the object)

This looks similar to what Marika mentioned.

~~I am assuming then that we can add separate config objects for both llmConfig and prompt as well.~~
Clarified in call, we just look at the content filtering module in this PR.

MatKuhr · 2024-08-08T13:40:25Z

packages/gen-ai-hub/src/orchestration/orchestration-client.ts

+        input.filterConfig?.input &&
+        input.filterConfig?.output &&


I think this is an issue, because only setting one of the two, or even setting none is allowed. But the current code leave the filter config empty, if one of the two is not defined, right?

MatKuhr · 2024-08-08T13:55:45Z

packages/gen-ai-hub/src/orchestration/orchestration-types.ts

+  /**
+   * Input configuration for filtering provider.
+   */
+  input?: FilterServiceProvider;


We could consider:

Suggested change

input?: FilterServiceProvider;

input?: FilterServiceProvider | [FilterServiceProvider];

That would also allow:

filterConfig: { input: [] as FilterServiceProvider,

Not sure how to best implement this internally, but using if (Array.isArray(config.input)) was what copilot suggested me 😉

Only downside I can think of is that the type signature becomes a bit more complex.

MatKuhr · 2024-08-08T13:58:51Z

packages/gen-ai-hub/src/orchestration/api/schema/azure-content-safety.ts

@@ -0,0 +1,15 @@
+/*


Is this really generated code? I am wondering why it is being added just now?

KavithaSiva · 2024-08-08T14:45:34Z

The latest state as per Junjie's recommendations are #58

KavithaSiva added 2 commits July 25, 2024 16:11

chore:add generated content filtering schema

7014a08

chore: Initial API design

eb4a115

chore:minor API adjustments

11bc217

KavithaSiva and others added 2 commits July 26, 2024 17:22

chore:add test

fdd7d5f

Merge branch 'main' into feat/add-content-filtering

d20e4ac

MatKuhr reviewed Aug 8, 2024

View reviewed changes

KavithaSiva closed this Aug 8, 2024

KavithaSiva deleted the feat/add-content-filtering branch August 16, 2024 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add content filtering #33

feat: Add content filtering #33

KavithaSiva commented Jul 26, 2024 •

edited

Loading

KavithaSiva commented Jul 26, 2024 •

edited

Loading

ChristophMatthies commented Jul 26, 2024

marikaner commented Jul 30, 2024 •

edited

Loading

jjtang1985 commented Aug 5, 2024 •

edited

Loading

KavithaSiva commented Aug 5, 2024 •

edited

Loading

MatKuhr Aug 8, 2024

MatKuhr Aug 8, 2024

MatKuhr Aug 8, 2024

KavithaSiva commented Aug 8, 2024

	input?: FilterServiceProvider;
	input?: FilterServiceProvider \| [FilterServiceProvider];

feat: Add content filtering #33

feat: Add content filtering #33

Conversation

KavithaSiva commented Jul 26, 2024 • edited Loading

KavithaSiva commented Jul 26, 2024 • edited Loading

ChristophMatthies commented Jul 26, 2024

marikaner commented Jul 30, 2024 • edited Loading

jjtang1985 commented Aug 5, 2024 • edited Loading

KavithaSiva commented Aug 5, 2024 • edited Loading

MatKuhr Aug 8, 2024

Choose a reason for hiding this comment

MatKuhr Aug 8, 2024

Choose a reason for hiding this comment

MatKuhr Aug 8, 2024

Choose a reason for hiding this comment

KavithaSiva commented Aug 8, 2024

KavithaSiva commented Jul 26, 2024 •

edited

Loading

KavithaSiva commented Jul 26, 2024 •

edited

Loading

marikaner commented Jul 30, 2024 •

edited

Loading

jjtang1985 commented Aug 5, 2024 •

edited

Loading

KavithaSiva commented Aug 5, 2024 •

edited

Loading