|
| 1 | +# dbt_zendesk v0.17.0 |
| 2 | + |
| 3 | +## New model ([#161](https://github.com/fivetran/dbt_zendesk/pull/161)) |
| 4 | +- Addition of the `zendesk__document` model, designed to structure Zendesk textual data for vectorization and integration into NLP workflows. The model outputs a table with: |
| 5 | + - `document_id`: Corresponding to the `ticket_id` |
| 6 | + - `chunk_index`: For text segmentation |
| 7 | + - `chunk`: The text chunk itself |
| 8 | + - `chunk_tokens_approximate`: Approximate token count for each segment |
| 9 | +- This model is currently disabled by default. You may enable it by setting the `zendesk__unstructured_enabled` variable as `true` in your `dbt_project.yml`. |
| 10 | + - This model was developed with the limit of chunk sizes to approximately 5000 tokens for use with OpenAI, however you can change this limit by setting the variable `zendesk_max_tokens` in your `dbt_project.yml`. |
| 11 | + - See the README section [Enabling the unstructured document model for NLP](https://github.com/fivetran/dbt_zendesk/blob/main/README.md#enabling-the-unstructured-document-model-for-nlp) for more information. |
| 12 | + |
| 13 | +## Breaking Changes (Full refresh required after upgrading) |
| 14 | +- Incremental models running on BigQuery have had the `partition_by` logic adjusted to include a granularity of a month. This change only impacts BigQuery warehouses and was applied to avoid the common `too many partitions` error some users have experienced when partitioning by day. Therefore, adjusting the partition to a month granularity will decrease the number of partitions created and allow for more performant querying and incremental loads. This change was applied to the following models ([#165](https://github.com/fivetran/dbt_zendesk/pull/165)): |
| 15 | + - `int_zendesk__field_calendar_spine` |
| 16 | + - `int_zendesk__field_history_pivot` |
| 17 | + - `zendesk__ticket_field_history` |
| 18 | + |
| 19 | +- In the [dbt_zendesk_source v0.12.0 release](https://github.com/fivetran/dbt_zendesk_source/releases/tag/v0.12.0), the field `_fivetran_deleted` was added to the following models for use in `zendesk__document` model ([#161](https://github.com/fivetran/dbt_zendesk/pull/161)): |
| 20 | + - `stg_zendesk__ticket` |
| 21 | + - `stg_zendesk__ticket_comment` |
| 22 | + - `stg_zendesk__user` |
| 23 | + - If you have already added `_fivetran_deleted` as a passthrough column via the `zendesk__ticket_passthrough_columns` or `zendesk__user_passthrough_columns` variable, you will need to remove or alias this field from the variable to avoid duplicate column errors. |
| 24 | + |
| 25 | +## Bug Fixes |
| 26 | +- Fixed an issue in the `zendesk__sla_policies` model where tickets that were opened and solved outside of scheduled hours were not being reported, specifically for the metrics `requester_wait_time` and `agent_work_time`. |
| 27 | + - Resolved by adjusting the join logic in models `int_zendesk__agent_work_time_business_hours` and `int_zendesk__requester_wait_time_business_hours`. ([#164](https://github.com/fivetran/dbt_zendesk/pull/164), [#156](https://github.com/fivetran/dbt_zendesk/pull/156)) |
| 28 | +- Fixed an issue in the `zendesk__ticket_metrics` model where certain tickets had miscalculated metrics. |
| 29 | + - Resolved by adjusting the join logic in models `int_zendesk__ticket_work_time_business`, `int_zendesk__ticket_first_resolution_time_business`, and `int_zendesk__ticket_full_resolution_time_business`. ([#167](https://github.com/fivetran/dbt_zendesk/pull/167)) |
| 30 | + |
| 31 | +## Under the hood |
| 32 | +- Added integrity validations: |
| 33 | + - Test to ensure `zendesk__sla_policies` and `zendesk__ticket_metrics` models produce consistent time results. ([#164](https://github.com/fivetran/dbt_zendesk/pull/164)) |
| 34 | + - Test to ensure `zendesk__ticket_metrics` contains all the tickets found in `stg_zendesk__ticket`. ([#167](https://github.com/fivetran/dbt_zendesk/pull/167)) |
| 35 | +- Modified the `consistency_sla_policy_count` validation test to group by `ticket_id` for more accurate testing. ([#165](https://github.com/fivetran/dbt_zendesk/pull/165)) |
| 36 | +- Updated casting in joins from timestamps to dates so that the whole day is considered. This produces more accurate results. ([#164](https://github.com/fivetran/dbt_zendesk/pull/164), [#156](https://github.com/fivetran/dbt_zendesk/pull/156), [#167](https://github.com/fivetran/dbt_zendesk/pull/167)) |
| 37 | +- Reduced the weeks looking ahead from 208 to 52 to improve performance, as tracking ticket SLAs beyond one year was unnecessary. ([#156](https://github.com/fivetran/dbt_zendesk/pull/156), [#167](https://github.com/fivetran/dbt_zendesk/pull/167)) |
| 38 | +- Updated seed files to reflect a real world ticket field history update scenario. ([#165](https://github.com/fivetran/dbt_zendesk/pull/165)) |
| 39 | + |
1 | 40 | # dbt_zendesk v0.16.0
|
2 | 41 | ## 🚨 Minor Upgrade 🚨
|
3 | 42 | Although this update is not a breaking change, it will likely impact the output of the `zendesk__sla_policies` and `zendesk__sla_metrics` models. [PR #154](https://github.com/fivetran/dbt_zendesk/pull/154) includes the following changes:
|
|
0 commit comments