Skip to content

Latest commit

 

History

History
502 lines (407 loc) · 21.9 KB

README.md

File metadata and controls

502 lines (407 loc) · 21.9 KB
title
Optimizing Housing under Lease Penalty

Collin McNulty

Background

Recently, my employer has moved offices from the East Side of Houston to the West. Because Houston’s East Side is almost entirely industrial, my wife and I currently live in an apartment in the town center, so my commute to the new office has not increased, but there is an opportunity to move closer to the new office and thus reduce our commutes. Additionally, the rent in the new area is lower than we are currently paying. However, we are currently in a lease agreement that lasts until June 1, 2017, so we would have to pay a penalty to break the lease.

The penalty is not a fixed amount, but rather is a function of when the lease is broken in the cycle and the time that it takes to re-rent the apartment. Therefore, it was not obvious to me what the optimal strategy is in order to maximize the time saved and minimize the additional expense incurred.

The three variables to optimize are when we officially break the lease, when we begin a new lease, and when we physically move to the new location. We are required to give 60 days notice, but we can overlap leases if it proves to improve the cost/time tradeoff.

A few fictionalizations have been introduced in the problem for the sake of the class. First, I considered that we can begin and break leases at any given day. In reality, only certain days are available for move-in and move-out. Also, all variables have been made continuous, while many of them are discrete by day or even by month. However, the shape of the problem is still preserved under these fictionalizations, and the insights are applicable to the real problem. I also considered a completely fictional version of the scenario that has a slightly more interesting optimization than the real scenario turned out to have.

Original Problem Formulation

Modelling

This first formulation of the problem is based strongly on the actual circumstances surrounding the move. The objectives were to maximize time saved and minimize additional cost. These independent objectives were combined into the following lexicographic minimum:

  1. All constraints met

  2. Maximize Time Saved per Additional Cost

  3. Maximize Time Saved

This enshrines the assumption that, in the area of the problem, utility is linear with both time saved and additional cost.

The penalty for the move was formulated as a retroactive increase in the amount of rent payed for the current apartment for all months in which we had stayed on the current lease, plus 85% of the rent in the months before the apartment was re-rented. The cost of the new apartment was, of course, also considered. Costs of physically moving were not considered, as we will move at the end of the lease in any scenario, so moving costs are not incremental. In that same spirit, the cost of the “do nothing” option was subtracted from the cost objective. Finally, all costs were converted from monthly costs to continuous time, to make the function differentiable and non-flat at all points.

$$Cost = RetroactivePenalty + RerentFee + NewRent - Current\ Rent$$ (1)
$$RetroactivePenalty = \left( CurrentLeaseStart - LeaseEnd \right)PenalizedRent$$ (2)
$$ReRent\ Fee = MonthsUnrented
0.85PenalizedRent$$ (3)
$$NewRent = \left( FinalDate - \ NewLeaseStart \right)
$ 825/month$$ (4)

For this first formulation, time saved was considered to be a linear function of the number of days in the new apartment.

$$TimeSaved = \left( Final\ Date - MoveDate \right)*SavedPerDay$$ (5)

Thus, there are 3 design variables: NewLeaseStart, LeaseEnd, and MoveDate.

Constants are shown in Table 1.

Table : Constants

Current Lease Start June 1, 2016 (-182)
Penalized Rent $1600/month
Final Date June 1, 2017 (183)
New Rent $825/month
Current Rent $1325/month
Months Unrented (assumption) 1.5 (45 days)
Notice Required 60 days
Current Date Dec 1, 2016 (0)

The constraints are also based on the real moving situation. They are as follows.

$$NewLeaseStart = < MoveDate = < LeaseEnd$$ (6)
$$LeaseEnd > CurrentDate + Notice\ Required$$ (7)
$$MoveDate > CurrentDate$$ (8)
$$NewLeaseStart > CurrentDate$$ (9)

Solving

The astute reader may notice that this problem has a simple obvious solution, or at least an obvious pareto frontier. I assure that astute reader that the solution seemed neither simple nor obvious to me at the time that I contemplated and formulated this problem. Nevertheless, it can be plainly seen that MoveDate should always be equal to NewLeaseStart as it is strictly advantageous to TimeSaved and has no impact whatsoever on Cost. This will prove to be the case in every formulation of the problem shown in this paper, so it can be argued that this is really a two design variable problem. However, my code treats it as a three variable problem, so I will give results for all three. Perhaps more simplifying though, is that both maximization of Time Saved and minimization of cost are improved with earlier dates for all three design variables, and the equations are monotonic on the domain in question. Once again, this was not apparent when the analysis was done.

Because the equations involve multiple pieces being combined in ways that I wanted to be able to edit quickly, I restricted the methods used to solve the problem to those that did not have to have an explicit derivative provided and could make do with only function evaluations. I used the SciPy package in Python for my optimization API.

Both the simplex algorithm and COBYLA were tested, and both arrived at the same answer. I also took several point estimates for reference. These points and the optimum are shown in Table 2. The code for this analysis is found in commit 4ed1c84.

Table : Values for Optimization of Original Formulation

Lease End New Lease Move Time Saved (minutes) Housing Cost ($) Ratio
183 183 183 0 3717.5 0
60 0 0 4967 2189.5 2.27
60 60 60 3338 540 6.18

Note that the point estimate for (183, 183, 183) corresponds to the “do nothing” option, which means that it does not make sense for the Cost, which is calculated relative to the do nothing, to be non-zero. This is because the modelling does not account for values that do not incur penalties. By changing the formulation to set the RetroactivePenalty to zero if LeaseEnd ≥ FinalDate and setting the Months Unrented to the maximum of the constant or (Final Date – LeaseEnd), this can be fixed. This changes the objective space to have two local optima, one corresponding to ending the lease early, and the other for not doing so. These local optima are shown below. Note that (183,183,183) now has an undefined ratio and thus objective function.

Table : Values for Correct Handling of Lease Penalty

Lease End New Lease Move Time Saved (minutes) Housing Cost ($) Ratio
183 0 0 4967.14 5032.5 0.99
60 60 60 3338.57 540 6.18
183 183 183 0 0 Undefined

The global optimum is unchanged. Note that the point (183, 0, 0) must use the entire lexicographic objective to be seen as the local optimum, any point (183, x1, x2) where x1=x2 will have the same ratio. The code for this altered analysis can be found in commit a2906cf.

Verification

The point (60,60,60) makes sense as the optimal point. Indeed, it is obvious once the problem is laid out clearly. The pareto frontier is defined by a straight line between the points (60,0,0) and (60,60,60) in the original objective function of maximize time, minimize cost. In the transformed space of the time/money ratio, only the point (60,60,60) is pareto optimal. This makes sense because, with increasingly earlier dates, up until (60, 60, 60), no tradeoffs need to be made between the objectives, but after the EndLease reaches its constraint, earlier dates in starting the new lease increase cost and increase time savings, a tradeoff. This tradeoff occurs at a lower ratio than is achievable at (60,60,60) so in the transformed space they are suboptimal.

The local optimum showed in Table 3 corresponds to the strategy of simply paying double rent and not ending the lease early. This makes sense because the penalty undergoes a step change, so there should be at least 2 locally optimal points, one on either side of the step. I had thought there might be some optimal point in which the two leases would overlap for only a small period, but because the time savings are linear, if they overlap for even one day, they may as well overlap as much as possible.

In summary, the results match the common sense solution of moving as fast as possible to minimize penalties while also avoiding the payment of double rent. The KKT conditions for an exterior point are satisfied because there is no feasible, improving direction around either local optimum.

Altered Problem Formulation

After completing the analysis above, I was concerned that one key objective had yet to be optimized: my grade in this class. The problem turned out to be much better behaved than I had initially imagined, and I was (and am) concerned that it would not demonstrate my learning from this class. Therefore, I envisioned an alternate cost penalty and time structure that would be more taxing, though not relevant to my real world problem.

Note on the Impact of Months Unrented

My first attempt to complicate the problem was to account for the uncertainty in Months Unrented to impact the solution. As a small test, I made the assumption that the apartment would never be re-rented within the duration of the original lease. This did not change the global or locally optimal strategies. The ratios got worse, but they did not add complexity. Thus, the uncertainty only affects the decision between the global optimum and the “do nothing”, whose objective is undefined, which is an uninteresting question. I then abandoned this line of inquiry.

Modelling

The objective, constraints and constants remained the same in this new problem formulation, but the equations for Cost and Time differ substantially.

For Cost, I imagined the penalty to be constructed in an opposite way, such that the penalty would be incurred on months that were abandoned rather than the months that were lived in. I also imagined that the penalty per day would be a function of the day on which the lease was broken, so the Cost would be a nonlinear function and would push the objective opposite the time savings.

$$Cost = Penalty + New\ Rent$$ (10)
$$Penalty = \ \sum_{1}^{183 - LeaseEnd}{(Penalty\lbrack day\rbrack)}\ \ \ + LeaseEnd\ mod\ 1*Penalty\ \lbrack last\ day\rbrack$$ (11)
$$\text{Penalty}\left\lbrack \text{day} \right\rbrack = \left( \frac{i}{183} \right)PenalizedRent - \left( 1 - \frac{i}{183} \right)\text{CurrentRent}$$ (12)

The last term in Equation 11 ensures that the function is not composed of step functions, which were found to confuse the solvers.

The Time Saved was reformatted to reflect the differences in time savings based on the day of the week according to Table 4.

Table : Time Savings Schedule

Sunday Monday Tuesday Wednesday Thursday Friday Saturday
Save 25 Lose 25 Save 20 Save 20 Save 20 Save 20 Save 25

This schedule results in the following shape for Time Saved:

Figure : Time Saved in New Formulation

This new formulation therefore has many local optima, and the surface is “bumpy”.

Solving

The COBYLA solver proved to be unable to navigate the bumps, and got stuck in local optima very frequently. I therefore switched to the SLSQP solver, which is also in the SciPy API. I also tried a wide variety of initial points within the space. This code is found in commit c9b77aa. Some of the local optima found are shown in Table 5. These points show that the non-linearity of the cost had very little impact on the optimum solutions. There was still a strongly linear component, so the solver has trouble differentiating between points which are along different parts of the cost curve. However, the non-linearity of time actually controls the globally optimal point. (183, 179, 179) is the global optimum because it never encounters the effect of losing time on a Monday. For earlier dates, the time saved appears more smoothed, and so the optima are all very similar ~=0.55.

One additional point is that in this case it is strictly better to not end the lease early. There’s no reason to incur the penalty at all, because it always costs more money and does not constrain from moving earlier. Therefore, only the two points highlighted in blue are truly local optima satisfying KKT conditions. All the other points have a feasible direction of improvement towards a later date for lease end.

Table : Local Optima for Reformulated Problem

Lease End New Lease Move Time Saved Housing Cost Ratio Solver Used
71 60 60 1850 8589 0.22 COBYLA
183 179 179 90 110 0.82 SQSLP
170 0 0 -2750 5565 0.5 SQSLP
183 0 0 2750 5032 0.55 SQSLP
161 67 67 1770 3201 0.55 SQSLP

Verification

The value for the global optimum makes sense as it exploits the direction and non-linearity of both objective components to the maximum degree possible. It suggests that penalties can be avoided and that a little extra time can be saved by moving out a little early. It also highlights a weakness in the model, as in order to get this most favorable tradeoff rate, one must trade only a little. However, if a ratio of 0.55 was still “worth it”, one could make that trade in a much higher volume. The pareto frontier in the original objective space is (183, x1, x2) where x1=x2, as cost can be strictly decreased by increasing LeaseEnd and TimeSaved can be independently increased by setting MoveDate equal to NewLease.

Conclusions & Learnings

The optimal strategy in the first scenario is to end the lease as soon as possible and begin a new lease and move as close to that date as possible as defined by the point (60, 60, 60). The optimal strategy in the second scenario is to not end the lease at all, and to move on the day after the last day that the new location would cause a loss of time as defined by the point (183, 179, 179).

The parameters of the problem were not nearly as entangled as I had though they were when coming up with the problem. In both formulations, the effects of time and cost were significantly more independent than I thought, and could be optimized more independently than it initially appeared. This has shown me the value of the modelling and exploration step of optimization, as once I truly understood the shape of the space, the optimum point was obvious.

Related to that unexpected simplicity, many of the effects that I had intended to model became unnecessary. I am glad that I kept it simple to begin with, as that simplicity enabled me to see that the core problem did not need to have much of its uncertainty modelled, even though that uncertainty is part of why I felt advanced techniques were needed to solve the problem. I was thus able to use off-the-shelf solvers and still gain a solid understanding of the problem’s dynamics.

(I wish I had gotten reason to demonstrate comparisons of more advanced techniques for purposes of the project originality and difficulty requirement, but the problem simply didn’t lend itself to that, despite my original confidence that it would)

Appendix

Formulation of Constraints

Equations 6-9 are presented in the most human readable form. The canonical version of each, which is used in the code, is always of the form: Constraint > 0 with the necessary algebra performed to be consistent with Equations 6-9.

Code

The optimal place to view the code is my GitHub repository. There, you can see every version of the code and can reference the commits in which certain results are taken from. I am also showing the text of the core files below

Github repository: https://github.com/collinmcnulty/moving-optimization

Naïve-Optimizer.py

# -*- coding: utf-8 -*-
"""
Created on Sat Dec 10 10:54:30 2016

@author: Collin
"""

import numpy as np
import scipy.optimize as optimize
from sample.input_def import MoveStrategy

class Optimizer:
@staticmethod
def objective(input_list, obj='tr'):
point = MoveStrategy(input_list)
return point.objective_value

def __init__(self, list, obj='tr'):
self.initial_point = MoveStrategy(list, obj)
self.cons = (
{'type': 'ineq',
'fun': lambda x: np.array([x[0] - x[1]]),
},
{'type': 'ineq',
'fun': lambda x: np.array([x[0] - 60])},
{'type': 'ineq',
'fun': lambda x: np.array([x[1]])},
{'type': 'ineq',
'fun': lambda x: np.array([x[2]])},
{'type': 'ineq',
'fun': lambda x: np.array([-x[0]+183])},
{'type': 'ineq',
'fun': lambda x: np.array([x[2] - x[1]])},
{'type': 'ineq',
'fun': lambda x: np.array([x[0] - x[2]])}
)
self.optimize_result = optimize.minimize(Optimizer.objective, list, args=(obj,), constraints=self.cons,
method='slsqp', options={'disp': False})
self.final_point = MoveStrategy(self.optimize_result.x)

def print_results(self):
print("Days until lease end: {:.0f}".format(self.optimize_result.x[0])),
print("Days until new lease: {:.0f}".format(self.optimize_result.x[1]))
print("Days until move: {:.0f}".format(self.optimize_result.x[2]))
print("Time Spent: {:.2f}".format(self.final_point.time))
print("Money Spent: {:.2f}".format(self.final_point.money))
print("Time/Money: {:.2f}".format(self.optimize_result.fun))

if __name__ == "__main__":
input_initial = [183, 180, 180]
a = Optimizer(input_initial, 'time')
a.print_results()

Input-Def.py

# -*- coding: utf-8 -*-
"""
Created on Sat Dec 10 14:00:52 2016

@author: Collin
"""
import numpy as np

class MoveStrategy:
def __init__(self, list, obj='tr'):
self.EL = list[0]
self.SL = list[1]
self.moving_time = list[2]
self.AR = 45
self.nparray = np.array([self.EL, self.SL, self.moving_time])
self.const_max_rent = 1600 / 30
self.const_new_rent = 825 / 30
self.const_normal_rent = 1325 / 30
if obj == 'tr':
self.objective_function = self.time_ratio
elif obj == 'time':
self.objective_function = self.time_ratio
self.find_time_spent()
self.find_money_lost()
self.calculate_objective_value()

def find_money_lost(self):
days_abandoned = 183. - self.EL
whole_days_abandoned = int(days_abandoned - (days_abandoned % 1))
dec1penalty = (1600 - 1325) * 182 / 365 + 1325
money = 0
for i in range(whole_days_abandoned + 1):
if i == whole_days_abandoned:
last_day = (days_abandoned % 1)**2 * (
i / 183 * dec1penalty + ((1 - i) / 183) * 1325) / 30 # Makes function continuous
money += last_day
else:
money += (i / 183 * dec1penalty + ((1 - i) / 183) * 1325) / 30
new_rent = self.const_new_rent * (183 - self.SL)
self.money = new_rent + money
return self.money

@staticmethod
def calculate_commute(move_date):
days_in_new_crib = 183 - move_date
whole_days_in_new_crib = int(days_in_new_crib - days_in_new_crib % 1)
time = 0
for i in range(whole_days_in_new_crib + 1):
if i == whole_days_in_new_crib:
mod = days_in_new_crib - whole_days_in_new_crib
else:
mod = 1
weekday = i % 7
if weekday in [0, 1, 5, 6]: # Thursday, Fri, Tues, Wed
time -= 20 * mod
elif weekday in [2, 3]:
time -= 25 * mod
else:
time += 25 * mod
return time

# TODO make commute times a probability distribution

def find_time_spent(self):
self.time = self.calculate_commute(self.moving_time)
return self.time

def calculate_objective_value(self):
self.objective_value = self.objective_function()
return self.objective_value

def time_ratio(self):
# Minimize time spent/money saved
if self.money > 1:
self.time_ratio_val = self.time / self.money
else:
self.time_ratio_val = -100
return self.time_ratio_val

def print_results(self):
print("Days until lease end: {:.0f}".format(self.EL)),
print("Days until new lease: {:.0f}".format(self.SL))
print("Days until move: {:.0f}".format(self.moving_time))
print("Time Spent: {:.2f}".format(self.time))
print("Money Spent: {:.2f}".format(self.money))
print("Time/Money: {:.2f}".format(self.time_ratio_val))

if __name__ == "__main__":
input_initial = [183, 182, 182]
a = MoveStrategy(input_initial)
a.print_results()