79591854

Date: 2025-04-25 05:29:41
Score: 1
Natty:
Report link

In ADF dataflow, I can see the expected output in the data preview section. However, when I run the pipeline, the final file is saved in Azure Blob Storage. I see that duplicate records for some reason. Some records are 14 times duplicate, some 10 times, some 7 times etc. I tried to tweak the partition settings as well, but to no avail.

For the issue you are facing I tried to find a work around. You can try following workflow to find. I have created one dataflow. In this source side I have added duplicate record. enter image description here

I have performed one aggregate condition to check the count of duplicate records. enter image description here

Then post that I have added conditional split. enter image description here I have sent the output of duplicate records to blob storage container. In the output you can check Alice is a duplicate record. enter image description here output of distinct I have sent to blob storage different container in the data preview you can see only distinct records are visible. enter image description here

Reasons:
  • Long answer (-0.5):
  • No code block (0.5):
  • Low reputation (1):
Posted by: Shraddha Pore