Cost Analyses of inaccurate regression model

HaukeVHaukeV MemberPosts:5Newbie
Hi,

我已经建立了一个模型timate house prices, however I would like to classify the predictions as correct or incorrect based on a % difference with the correct price. Any idea how I can do that? I would then like to apply a cost and a benefit for every correct/incorrect prediction.

Kind regards,
Hauke
Tagged:

Answers

  • lionelderkrikorlionelderkrikor Moderator, RapidMiner Certified Analyst, MemberPosts:1,195Unicorn
    Hi@HaukeV,

    You can use theGenerates Attributesoperator to calculate the difference between the predicted price and the correct price :
    for examplediff = (pred(price) - true(price)) / true (price)and then apply a threshold to apply a cost or a benefit.

    Hope this helps,

    Regards,

    Lionel
    HaukeV
  • HaukeVHaukeV MemberPosts:5Newbie
    Hi Lionel, thanks for the answer. We now have 2 columns, one with true/false if the prediction is 15% or more lower, and a second column true/false if the prediction is 15% or more higher.

    Now trying when we try to run the cost analyses for classification, so that a true in either category is given the appropriate cost, but we keep running into issues. Tried manually redefining labels, predictions,.. Any help would be much appreciated.
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University ProfessorPosts:3,421RM Data Scientist
    Hi,
    you may use Performance (Cost) as an operator here. Is this a commercial project? That way we can help you with presales ressources to make your point.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
    HaukeV
  • HaukeVHaukeV MemberPosts:5Newbie
    Hi Martin,

    thanks for your information. This is not a commercial project, just practice for a paper. I changed my setup and now I have only one attribute which classifies as either 0, too low, or too high. I then tried to use the Performance (cost) operator on this to add a cost matrix. To make that work I first had to add a default model which just copies the too low, too high or 0 class to a predicted class. I set all values to 0 in the matrix, and then only if the predicted class = the actual class I add the required cost. However when I do the math manually (80 cases too low, 100 cases too high, multiplied by their respective costs)this does not at all match the math that the performance (costs) module does.
    Thanks for your help!!
Sign InorRegisterto comment.