Reports

In ADF dataflow, I can see the expected output in the data preview section. However, when I run the pipeline, the final file is saved in Azure Blob Storage. I see that duplicate records for some reason. Some records are 14 times duplicate, some 10 times, some 7 times etc. I tried to tweak the partition settings as well, but to no avail.

For the issue you are facing I tried to find a work around. You can try following workflow to find. I have created one dataflow. In this source side I have added duplicate record. enter image description here

I have performed one aggregate condition to check the count of duplicate records. enter image description here

Then post that I have added conditional split. enter image description here I have sent the output of duplicate records to blob storage container. In the output you can check Alice is a duplicate record. output of distinct I have sent to blob storage different container in the data preview you can see only distinct records are visible.

79591854