Less Columns, More Rows = More Speed!

August 2, 2011 at 9:44 am

I’d seen the minimize column count scripture before, but never took it to heart. I had actually gone to the trouble of taking a one value column data set and breaking it out into 14 columns (360k rows) for a report redesign.
After reading this I gave in and relinked to the old data source and fixed the measures. (4.9 million records) Refresh time dropped from 8 minutes to a little over 4. In summary all that redesign work I had done made my model 100% more inefficient. Seems about par for the course.

Thanks for beating the point into my head.

August 2, 2011 at 3:01 pm

Nothing new for us data warehouse thick heads. Add a measure type dimension to your fact table like Actual, Budget, ForecastX to ForecastN measure types. That will reduce the number of measures.

Thomas Ivarsson

August 7, 2011 at 4:50 am

You’ve suggested in previous posts that sorting the data results in even better compression and performance. Just wondered whether you tried sorting by ValueType before loading and if so did it make a noticable difference on performance?

August 16, 2011 at 3:46 am

Like Dereck, I had spent time trying to minimise the number of records I was using at the expense of more columns despite my brain telling me that it wasn’t quite right. I’ve reverted back to a “skinny” structure, tuned up some measures and ensured that sorting is applied to the dataset on the way in as far as possible and performance is excellent.

Unfortunately, when deployed in SharePoint, I’m not getting the same response but maybe that’s due to network or some other factors. At least I’ve got another challenge!

Brilliant Rob. Thanks for another excellent post.

May 6, 2012 at 5:57 pm

Hi, how does that work with different data types. Example, percentages, dollars, time durations, etc. When a pivot table is created and you need to format each data type would this work?

May 9, 2012 at 6:06 pm

Hello, I am still confused on how this was done, was the sql or code changed behind the scenes to combine the columns? I have files that are very large 15 – 20 columns, this would be very useful. Will the dataset still get the new data from the database automatically or will this break the connection?

Thanks for your help!

September 13, 2012 at 2:52 am

Hi Andrea,
tte video https://www.youtube.com/watch?v=xmqTN0X-AgY shows how to normalize data in Excel.
I hope it helps
Juergen

August 11, 2016 at 6:55 am

A much more efficient way:
https://www.excelguru.ca/blog/2013/11/14/un-pivoting-data-in-power-query/

August 11, 2016 at 6:27 pm

Yes. When this article was written, we didn’t have power query 🙂

January 11, 2014 at 11:55 am

BEST example i’ve seen! i’m glad you covered this

August 12, 2015 at 2:06 pm

Question: So if you had a table with 20 columns. Would it be better to separate that table into 2 tables with one being 10 columns and the other 11 columns (utilizing a primary key)? Or could this actually make it slower?

October 8, 2016 at 9:47 am

Does anyone have an answer to adamflath’s query above? “Question: So if you had a table with 20 columns. Would it be better to separate that table into 2 tables with one being 10 columns and the other 11 columns (utilizing a primary key)? Or could this actually make it slower?”

Less Columns, More Rows = More Speed!

A Long-Held Belief, Quantified

The Original Data

The Taller, Narrower Table

The Results

Why Faster? Will This ALWAYS Work?

Cancel reply