Power Query: Project Population – Part 10
28 June 2023
Welcome to our Power Query blog. This week, I transform a new query from a public data source.
I have found some information on population growth provided by The World Bank, which I am using as an example of how to transform real-life data.
data:image/s3,"s3://crabby-images/63fd1/63fd1893817cc7b61d2876f28f29deb153889c82" alt=""
I have been transforming the data and last week, I improved the steps created automatically by Power Query when I used ‘Merge Queries’ to combine the data from Country and Country-Series.
data:image/s3,"s3://crabby-images/babbe/babbe20ed4b87be93f9b5ed20819437a0628a347" alt=""
This time, I will move on to the Data query and prepare this data.
data:image/s3,"s3://crabby-images/3e0d5/3e0d574514b5c5b98f78f152a6fafe9155a0b1e2" alt=""
This query appears to contain information about the population of each country. I note that although I have a column with the heading ‘Country Code’ in the first row, when I look at the values underneath ‘Country Name’, I see that the first rows are for a region, not a single country.
I can look at the filter on Column1 to see what values are in this column:
data:image/s3,"s3://crabby-images/5dbef/5dbef6a27289a1b578eb0167ba276935c9b41f27" alt=""
I have a mixture of data, some for individual countries and some for regions. When I come to merge this data, I will need to ensure I don’t ignore the rows in Data that do not match a country on Country.
I start by cleaning up the Data query. I need to promote the first row to the headings, which I can do from the ‘Home’ tab:
data:image/s3,"s3://crabby-images/82f7f/82f7f613ab9da99893ea2cf89d1fc54613dbfe88" alt=""
This also generates a ‘Changed Type1’ step for me:
data:image/s3,"s3://crabby-images/9a681/9a6814889f9f4f98ce772e4f8cd6d80c786b1e6e" alt=""
I don’t need the earlier ‘Changed Type’ step, so I can delete it using the cross icon next to it:
data:image/s3,"s3://crabby-images/f78bf/f78bf7663c59575f6effa1afef9a5e14fc6a0b84" alt=""
I am prompted to confirm this choice:
data:image/s3,"s3://crabby-images/afc60/afc6031e1a049f84c8bcb3cda0483fa5865aef1d" alt=""
If this were a step that affected the name of a column, or if the subsequent steps relied on the data type being changed, then this could give me problems, but since I know that it will not affect the later steps I can go ahead. Power Query does not have an ‘undo’ option. Therefore, if I were uncertain about whether deleting a step would cause problems, I could make a copy of the query by right-clicking in the Queries pane and choosing to duplicate the query before proceeding.
Having deleted the step, a quick way to see all the columns without scrolling is to use the ‘Choose Columns’ feature on the Home tab:
data:image/s3,"s3://crabby-images/24dc0/24dc0e62d4ff0dc8cc71c9ba0e833f8b4b2a3302" alt=""
This allows me to see the complete list:
data:image/s3,"s3://crabby-images/f293a/f293a4d8a9e9decd427c111c58720a0079e46b64" alt=""
I have the Country Name, Country Code, IndicatorName, Indicator Code and then the years from 1960 projected to 2050. I will only uncheck the Indicator Code column, as it’s not giving me any useful information.
data:image/s3,"s3://crabby-images/19fb1/19fb184b0c7e89b0af5580e383fb23e6a2d83568" alt=""
Note that the step created is ‘Removed Other Columns’ as it is safer to keep the columns I need, since if any columns are added to Data, my query will be unaffected.
Looking at the data, I can see that there are some categories where all the year columns have null amounts:
data:image/s3,"s3://crabby-images/b335c/b335cfb06719d7b2347ff5923abc4e401a962304" alt=""
I can remove these rows. I start by selecting all the year columns. The easiest way to select all the years is to select 1960, scroll across to 2050, and then press SHIFT whilst I select column 2050, to select the complete range of years.
data:image/s3,"s3://crabby-images/03144/03144bb4b6d280f4effffd429c0620183d97e210" alt=""
Next, I merge the columns from the ‘Add Column’ tab:
data:image/s3,"s3://crabby-images/7c71f/7c71f1e7a8f97c2844754b3e10a8b7bcfb7a14ae" alt=""
I can take the default settings in the dialog box, as I will be deleting this column soon.
data:image/s3,"s3://crabby-images/828a0/828a0f61fdd07c1465811edee1f5f12e45f5fb71" alt=""
This gives me one column, which I can filter:
data:image/s3,"s3://crabby-images/d3b4c/d3b4c200ab45d4cef7ffd7beac59a87d26dfc814" alt=""
I choose to ‘Remove Empty’:
data:image/s3,"s3://crabby-images/c403f/c403fc4fbf37a65d8f387bf3dd1838f6833ecc5f" alt=""
I can now delete the Merged column.
data:image/s3,"s3://crabby-images/65cd0/65cd0cde81c08d39d461eda36d2d7f1f6965e4ce" alt=""
Next time, I will look at some ways to combine my Data and Country queries.
Come back next time for more ways to use Power Query!