Team-BHP > The Indian Car Scene
Register New Topics New Posts Top Thanked Team-BHP FAQ


Reply
  Search this Thread
211,672 views
Old 6th June 2010, 20:47   #256
Team-BHP Support
 
SmartCat's Avatar
 
Join Date: Jun 2007
Location: Bangalore
Posts: 6,663
Thanked: 45,797 Times

But doc, no car is a "zero". Can't you get rid of this anomaly using a 0.1 to 1 scale?
SmartCat is online now  
Old 6th June 2010, 20:57   #257
Distinguished - BHPian
 
Dr.AD's Avatar
 
Join Date: Apr 2010
Location: Bangalore/Pune
Posts: 1,852
Thanked: 19,594 Times

Quote:
Originally Posted by NutsNBolts View Post
Dr AD, thanks for such a wonderful exercise and explanation. Of course, without the initial spadework by smartcat and echo77, it wouldn't have been achievable. My sincere thanks three of you and a request to mods to make this thread a sticky. This thread doesn't deserve to go un-noticed by people deciding to buy sedans in near future.
Thanks a lot. Yup, it will be great if this thread could be made sticky.

Quote:
Originally Posted by smartcat View Post
But doc, no car is a "zero". Can't you get rid of this anomaly using a 0.1 to 1 scale?
Smartcat, I completely agree with you. A score of "zero" does not make sense. It is a tempting thought to "alter" the method a bit to take care of the zero problem. However, unless we can find a very intuitive and fair method of doing this, just arbitrary alteration will end up losing the simplicity and logic behind doing this.

I will think about this problem and come up with an improvement in a day or two. But for now, I can say that the score of "zero" can be simply interpreted by the reader as "that car came last in at least one of the categories" (note: there can be cars that come last in more than one categories). We all know that no car is a zero, and we may very well go ahead and not just like but even end up buying a car the scores a zero. In my own case, I am in search for a sedan, and I may end up buying a Linea E+ Petrol, even though all logic says it scores a 0 in this analysis.
Dr.AD is online now   (1) Thanks
Old 6th June 2010, 20:57   #258
BHPian
 
arishi1's Avatar
 
Join Date: Apr 2005
Location: Thane - Mumbai
Posts: 305
Thanked: 22 Times

Dr. AD excellent compilation using the correct Statistical model, and as always thanks to the SMART (Smartcat) and Ecko77.

One request to Dr. AD, can you share the excel spreadsheet here.

MODS: Please make this thread sticky, its a wonderful comparison for all future car buyers.
arishi1 is offline  
Old 6th June 2010, 21:06   #259
Senior - BHPian
 
Join Date: Sep 2008
Location: Bangalore
Posts: 1,891
Thanked: 43 Times

Good cars will remain good irrespective of the statistical method used.

i see the Optras, Manza, City, Vernas, retaining their top dog rating becasue they do not have no real weak links and are all round performers.

where as Fiestas, Lineas, Suzukis have been bit more volatile compared to the above top dogs depending on the statistical method.

So, now we have our T-BHP's (un) official market research and benchmarking team consisting of smartcat, echo77 and Dr.AD.

Great work guys, i have rated this thread 5 star, certainly worth being a sticky thread.

StarVegabond is offline  
Old 6th June 2010, 21:10   #260
Distinguished - BHPian
 
Dr.AD's Avatar
 
Join Date: Apr 2010
Location: Bangalore/Pune
Posts: 1,852
Thanked: 19,594 Times

Quote:
Originally Posted by arishi1 View Post

One request to Dr. AD, can you share the excel spreadsheet here.
Yes, certainly. However, I am really bad at Excel and I would be embarrassed to share my current sheet. I did some haphazard and unorganized changes to echo77's sheet to quickly calculate the results table.

I will need to clean up my sheet and do it in a proper way before I can share it with you guys. Please give me a day or two to do this.

Quote:
Originally Posted by StarVegabond View Post
Good cars will remain good irrespective of the statistical method used.

i see the Optras, Manza, City, Vernas, retaining their top dog rating becasue they do not have no real weak links and are all round performers.

where as Fiestas, Lineas, Suzukis have been bit more volatile compared to the above top dogs depending on the statistical method.

Very good comment, and I completely agree with you. Sometimes, statistics is just a way of convincing ourselves of the obvious. In my own case, I am at pain to see Linea (especially Petrol) languishing at the bottom of the tables regardless of the method used.

However, we all "love" certain cars (which is difficult to capture in any statistical model), and I believe that is finally the deciding factor when we buy a car. If we did not have this unexplainable notion of loving some cars, we all would be driving only a few type of cars ;-)

By the way, let us not forget that factors like "ride and handling" are not even included in any of the methods used here. Is it merely a coincidence that some of the top cars in the table are not recognized on this forum as being good handlers, whereas some of the bottom cars are especially praised for their handling?

Thanks,
-AD

Last edited by Dr.AD : 6th June 2010 at 21:25.
Dr.AD is online now  
Old 6th June 2010, 22:13   #261
BHPian
 
bhunu AMG's Avatar
 
Join Date: Oct 2009
Location: coimbatore
Posts: 91
Thanked: 57 Times

Smart cat ...hats off to you man ,really great work and thanks a lot for taking this much pain for us and last but not the least thank you DR.AD and ecco77 for the great work
I have rated five star..yup you deserve it.regards

Last edited by bhunu AMG : 6th June 2010 at 22:14.
bhunu AMG is offline  
Old 6th June 2010, 22:52   #262
Distinguished - BHPian
 
Dr.AD's Avatar
 
Join Date: Apr 2010
Location: Bangalore/Pune
Posts: 1,852
Thanked: 19,594 Times

One more interesting example (of significant change in rankings due to different method of calculations) I just noticed is that of Tata Indigo XL 1.4 Diesel. This car gets the best score for "practicality", and hence a normalized score of a perfect 1.0, but unfortunately gets a very poor normalized score (0.0041) for "safety". Hence, the compounded score is very low, and the final ranking of this car is moved down to 20 from 9 (given earlier). The Compounded Score says that you can't compromise on safety and balance that out by better practicality, and hence penalizes this car heavily.

Thanks,
-AD
Dr.AD is online now  
Old 6th June 2010, 23:00   #263
Senior - BHPian
 
Join Date: Sep 2008
Location: Bangalore
Posts: 1,891
Thanked: 43 Times

Best method is the publish a Poll here on degree of importance for each category of benchmarking and assign the relevant weightage to each class of factors and use it as a multiplier.

this way we can include the populist sentiments as well as factors like handling etc.

The weightage so arrived would be from collective buyer (or member) opinion hence more or less democratic

Last edited by StarVegabond : 6th June 2010 at 23:02.
StarVegabond is offline  
Old 7th June 2010, 01:47   #264
BHPian
 
echo77's Avatar
 
Join Date: May 2010
Location: goa
Posts: 359
Thanked: 117 Times

Dr.AD - simply replace the 0 values by 0.01 and see the results.

I was trying a sum based method similar to yours, but it really messes things up! Marina moves up to 4th!

Attaching the sheet in case anyone wants to mess it up more. I think this includes what Dr.AD has done, except that I have replaced zero values by 0.01
Attached Files
File Type: xlsx sedans.xlsx (93.1 KB, 498 views)

Last edited by echo77 : 7th June 2010 at 01:49.
echo77 is offline  
Old 7th June 2010, 05:10   #265
BHPian
 
Join Date: Jun 2009
Location: Chennai
Posts: 437
Thanked: 643 Times

Aah..Was waiting to comment until the final rankings were out. I was pretty sure that Tata Manza will end up at the top. A true VFM car. However, it is funny when people comment on the rankings, most of us will talk about all other cars (Dzire, Cedia et al) and completely ignore Manza!!
chennai-indian is offline  
Old 7th June 2010, 11:59   #266
ACM
Distinguished - BHPian
 
ACM's Avatar
 
Join Date: Mar 2008
Location: Mumbai
Posts: 4,751
Thanked: 4,378 Times

Dr. AD - Excellent. you did keep your word.

Agree with smartcat and Echo that "0" somehow seems unfair so 0.1 rating for all zero values or a scale of 0.1 to 1 or 0.1 to 1.1 would be ideal.
ACM is offline  
Old 7th June 2010, 12:17   #267
BHPian
 
Join Date: Mar 2010
Location: Bangalore
Posts: 167
Thanked: 362 Times

Quote:
Originally Posted by Dr.AD View Post

Smartcat, I completely agree with you. A score of "zero" does not make sense. It is a tempting thought to "alter" the method a bit to take care of the zero problem. However, unless we can find a very intuitive and fair method of doing this, just arbitrary alteration will end up losing the simplicity and logic behind doing this.

I will think about this problem and come up with an improvement in a day or two. But for now, I can say that the score of "zero" can be simply interpreted by the reader as "that car came last in at least one of the categories" (note: there can be cars that come last in more than one categories). We all know that no car is a zero, and we may very well go ahead and not just like but even end up buying a car the scores a zero. In my own case, I am in search for a sedan, and I may end up buying a Linea E+ Petrol, even though all logic says it scores a 0 in this analysis.
Hi AD,

How about this: We are using '1' the multiplicative identity for the top car. How about using it as the (geometric) mean value. Suppose we decide the penalty factor, say '2' as an example. The best car gets "1 * 2" while the worst car gets "1 / 2" (or 0.5).

Will this satisfy your intuitive constraint

Regards,
Prasad.
pnredkar is offline  
Old 7th June 2010, 12:54   #268
BHPian
 
MihirC's Avatar
 
Join Date: Nov 2009
Location: Mumbai
Posts: 534
Thanked: 71 Times

Hey Smartcat, really awesome list you put up there. I dont mean to buttt in but had a suggestion, could we have a column on the high speed manners of these cars, say at speeds above 130km/hr or something.
MihirC is offline  
Old 7th June 2010, 16:35   #269
Team-BHP Support
 
SmartCat's Avatar
 
Join Date: Jun 2007
Location: Bangalore
Posts: 6,663
Thanked: 45,797 Times

Quote:
Originally Posted by MihirC View Post
Hey Smartcat, really awesome list you put up there. I dont mean to buttt in but had a suggestion, could we have a column on the high speed manners of these cars, say at speeds above 130km/hr or something.
I have data on top speed of each car. Does that help?

I guess at 130 kmph, a car that has been built for a top speed of 200 km/hr will be more stable, comfortable & confidence inspiring than a car that has been built for a top speed of 160 km/hr.
SmartCat is online now  
Old 7th June 2010, 17:02   #270
Distinguished - BHPian
 
Dr.AD's Avatar
 
Join Date: Apr 2010
Location: Bangalore/Pune
Posts: 1,852
Thanked: 19,594 Times

Quote:
Originally Posted by echo77 View Post
Dr.AD - simply replace the 0 values by 0.01 and see the results.
Somehow I do not like these arbitrary adjustments like replacing 0 by 0.01. In the normalized system, when a car gets a score of 0, it has a very intuitive and also mathematically sound meaning: that the car is the last (in that category) out of all cars in consideration. When we artificially "fudge" this result to make it 0.01, this meaning is lost. What if there is another car which is second from last and genuinely gets a score of 0.01? How do we differentiate this from our artificial 0.01 correction? Hence, I am not in favor of this. But certainly thanks for the suggestion!

Quote:
Originally Posted by pnredkar View Post

How about this: We are using '1' the multiplicative identity for the top car. How about using it as the (geometric) mean value. Suppose we decide the penalty factor, say '2' as an example. The best car gets "1 * 2" while the worst car gets "1 / 2" (or 0.5).

Will this satisfy your intuitive constraint
A very interesting suggestion and thanks for it. Yes, it does satisfy the intuitive constraint for the best and the worst case. However, I could see two potential issues:

1. What about the in-between cars? Currently, the formula is simple: normalized score = (x-xmin)/(xmax-xmin); Note that the final difference in normalized scores is linearly proportional to the actual values of "x". In other words, the distribution of normalized scores is the same as of original values. How do you get the geometric series for all in-between values? And also, won't this change the type of distribution? If so, it will introduce errors similar to those of using pure ranking positions as used by Smartcat in the first place. Note that the distribution of ranks (1 through 26) is an uniform distribution whereas the distribution of actual values under any category is not uniform (it could be normal, or something else also).

2. With scale of 0 to 1, the nice property is that multiplication of any numbers (for any number of categories) between 0 to 1 is also a result between 0 to 1. I somehow like this property. This will be lost if some cars are given points such as 2 or 1.5. But this may be a relatively minor point.

By the way, my apologies if I have misunderstood your suggestion and the above points are not true. Please clarify if that is the case. Anyway, thanks a lot for suggestion.

Actually I have thought of a method to take care of this. I will work on this tonight (hopefully will get some time) and will post new results here late tonight. That method has some advantages and some drawbacks, and we will see what happens to the results. May be another revision will be required to take care of the drawbacks

Thanks,
-AD
Dr.AD is online now  
Reply

Most Viewed


Copyright ©2000 - 2024, Team-BHP.com
Proudly powered by E2E Networks