Power Query: The Delete Dilemma
22 July 2020
Welcome to our Power Query blog. This week. I look at data from a sheet made up of a number of tables, with a random amount of data that needs to be excluded.
Mary, one of my imaginary salespeople, has sent in a slightly unusual report of the tents available from a particular supplier.
data:image/s3,"s3://crabby-images/35cfd/35cfd3b7dcbd04b679949cb3b0c9e7f39c16fc81" alt=""
I want to put the tent, supplier and report date into a table, and delete the lines at the top (there will be an unknown number of these). I begin by extracting my data to Power Query, using ‘From Table’ on the ‘Get & Transform’ section of the Data tab.
data:image/s3,"s3://crabby-images/7081c/7081c194916d2d1848af41bbf36f523da26e4567" alt=""
The ‘Create Table’ dialog box tries to default to the table I happen to be in, but I need to change this to include all of my data; instead of using ‘From Table’, I will adopt another approach.
data:image/s3,"s3://crabby-images/dff8f/dff8f70baf4a5f3fab2b352450f51c3f7f0798f6" alt=""
I select ‘From Workbook’ in the dropdown from the ‘From File’ option on the ‘New Query’ options, and select my current workbook.
data:image/s3,"s3://crabby-images/5c83d/5c83d5b86be781074f019452fc98f38414856d4e" alt=""
I select the sheet.
data:image/s3,"s3://crabby-images/cc5d2/cc5d250b246199658a7a776239fd1c038e18ddbb" alt=""
I can see that all my data has been included; I can now choose to ‘Transform Data’.
data:image/s3,"s3://crabby-images/6b10b/6b10b39f19b3336bcf96dd7a8db7452534842e83" alt=""
Power Query has automated some steps, but I don’t want to include most of them, so I delete all steps after the ‘Navigation’ step.
data:image/s3,"s3://crabby-images/5d31f/5d31f1c1879cc1cfc875706905d9111ec23d339a" alt=""
I need to remove the top rows, but I want to avoid specifying how many of these rows there are. I plan to delete all rows above the date value. I begin by adding an index from the ‘Add Column’ tab.
data:image/s3,"s3://crabby-images/3d69a/3d69a61ce050a037eb53d70ee7c63609decff442" alt=""
Next, I create a duplicate column of Column3 (which contains the report date value) on the ‘Add Column’ tab.
data:image/s3,"s3://crabby-images/503a5/503a59730e526de45bd70e1bd1f7b579fc3d2c8a" alt=""
In order to get rid of all the data apart from the date, I convert Column3 – Copy to Data Type ‘Date’. This may be done from the ‘Transform’ tab.
data:image/s3,"s3://crabby-images/eacbd/eacbd655378bd4dd260279d2836076a75c25aee8" alt=""
This means that apart from the date, the data values are now either null or Error. If I right-click my column, I can replace the errors with null values.
data:image/s3,"s3://crabby-images/c5075/c5075ffc40aba41028b7227ef24cb9a08026121c" alt=""
This gives me null values, and I can now fill down by right-clicking, choosing to ‘Fill’ and then choosing to ‘Fill Down’. The rows above the date do not matter as I want to delete them.
data:image/s3,"s3://crabby-images/628cd/628cd8f328310f1bbfb60543545715340f569ef2" alt=""
I need to delete the null value rows, so I filter and choose values that are not null.
data:image/s3,"s3://crabby-images/5bebd/5bebd154ba2388c5bbc95c9017845a1fb42bb439" alt=""
This gives me the rows with a date value.
data:image/s3,"s3://crabby-images/147c7/147c739b7fddfc6299b9b1b93163d6125472ca62" alt=""
I have the date column. Now, I want to get the supplier into a column. I create another duplicate of Column3:
data:image/s3,"s3://crabby-images/8e623/8e6234f7993ecd27b974e4fba73aa3bd36a315d6" alt=""
I convert the Data Type on my new column to ‘Date’ because I want to isolate the value with the date.
data:image/s3,"s3://crabby-images/91c0d/91c0d8790a6def5da3269d678ed9bd06d12de44f" alt=""
I create a new conditional column from the ‘Add Column’ tab.
data:image/s3,"s3://crabby-images/1c640/1c6401080f39c232c99c1a7afe09266c823c8921" alt=""
If the value is not null, then I want to put the supplier from Column2 into my new column, which I call Supplier. The values with errors will remain as errors.
data:image/s3,"s3://crabby-images/7cf07/7cf07d561f85050bee43cab2133fdb20ec3e0a49" alt=""
I can now right-click on Supplier and replace the errors with null.
data:image/s3,"s3://crabby-images/bf8a4/bf8a44d1986e7fbbdc83061ccba4cefc37529119" alt=""
Having done this, I can fill down my supplier, as before.
data:image/s3,"s3://crabby-images/32a49/32a49810654702286b0c13e9565661cf4099d602" alt=""
I have one more row to remove, and I can do this by filtering on Column1 to remove null values.
data:image/s3,"s3://crabby-images/12e84/12e840a006fa40218fdec43911c47bb6bfa92727" alt=""
I have one final row to delete, but this time it has values in all columns.
data:image/s3,"s3://crabby-images/6ec86/6ec864146af197f4aceb4c37672f7485d948ed5e" alt=""
The data I need in the top row has already been moved to columns. I could just delete the top row, but instead I choose to identify it by creating a new conditional column.
data:image/s3,"s3://crabby-images/a0776/a0776576e4b5bde32afbe07cc68280b838eac5f9" alt=""
I choose to create a new column which will indicate a row should be deleted if the value in Supplier matches the value in Column2.
data:image/s3,"s3://crabby-images/01645/01645042642022ef52aeb08bab1251e237d543bd" alt=""
I can then filter the Delete Me column to remove everything that isn’t null.
data:image/s3,"s3://crabby-images/944c4/944c447be221e55835fc4e5c2309f59b6d212dc4" alt=""
I now consider the column headings, but first I want to remove Column3 – Copy1 because the errors might cause problems. I also remove my other temporary columns at the same time.
data:image/s3,"s3://crabby-images/6dcd8/6dcd8e768b819da3352f6e02417079eb76043c6e" alt=""
I can now promote the first row to headers from the Transform tab.
data:image/s3,"s3://crabby-images/58477/58477cf110501424f700c3476d60818255054b28" alt=""
I just need to tweak the supplier and date headings.
data:image/s3,"s3://crabby-images/c9360/c93605249bd8655fb917d00994bf067a87e3e106" alt=""
My final step is to ‘Close & Load’ my data into Excel.
data:image/s3,"s3://crabby-images/e1ca3/e1ca3ef9c7306eb1cf2c793b42a9f59bf0837b43" alt=""
If I go back and alter the data by adding some extra rows to be deleted, I can check my transformation.
data:image/s3,"s3://crabby-images/2d29d/2d29dbd302873adf002101d1ae527581f4facb09" alt=""
I create another line at the top and an extra space before the tent data. Then, I refresh the query.
data:image/s3,"s3://crabby-images/36e97/36e97f1186b1cc2322952d2dc60952fa20c52478" alt=""
The extra lines appear in the source.
data:image/s3,"s3://crabby-images/09478/09478a7eadd6ab1c69c226ff6f7a1d99a47e0d2b" alt=""
The data is still transformed correctly.
Come back next time for more ways to use Power Query!