1 / 56

Mgmt 237H Lecture #7

Mgmt 237H Lecture #7. Professor Jason C. Hsu, Ph.D. Admin. Incorporating Signal Into Portfolios. Portfolio Mathematics. Portfolio weights: Sequence of portfolio weights over time : Stock returns: Portfolio return: Note: w _t is the vector of weights at the beginning of period t

bell-webb
Download Presentation

Mgmt 237H Lecture #7

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Mgmt 237HLecture #7 (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC Professor Jason C. Hsu, Ph.D.

  2. Admin (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC

  3. Incorporating Signal Into Portfolios (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC

  4. Portfolio Mathematics Portfolio weights: Sequence of portfolio weights over time : Stock returns: Portfolio return: • Note: • w_t is the vector of weights at the beginning of period t • r_t is the vector of return over holding period t • A proper long only portfolio has weights sum to 1

  5. Some Simple Portfolios • Equal weighting: • Cap-weighting:

  6. Active Portfolio Benchmark portfolio: Active Portfolio: These long only portfolios have weights sum to 1 Active Weights: *This a long-short portfolio with weights sum to 0

  7. Active Portfolio Returns • E{R_a} = alpha + E{R_b} • Realized excess return relative to benchmark = R_a_t – R_b_t; • Arithmetic Average of excess return relative to benchmark is a proxy for alpha

  8. Portfolio Risk • TE: • TE can also be estimated by the volatility of the realized excess return • TE measures the active deviation from the benchmark • The larger is the TE the larger are the deviation in stock weights against the benchmark stock weights • Manager has greater conviction (will take on larger active weights, which leads to larger TE)

  9. Active Weights for Long Only Strategy • The active weights for a long only strategy can be expressed as a long-short portfolio (active weights sum to 0%). • The LS portfolio created in the previous sections can be used as the basis for the active weights. • Recall that the portfolio can be described by • So where the active weight portfolio (AW) is just a scaled version of the long-short portfolio where is chosen to satisfy the TE constraint of the long only portfolio.

  10. Active Weights for Long Only Strategy • To continue, we need to know the benchmark weight and the TE budget for the long only portfolio. • Recall that active weights and ex ante TE have the following relationship • Vol (active weight portfolio) = TE Where is the vector of the active weights and is the covariance matrix for the stocks in the benchmark index • Since ,

  11. Negative Weights • The active long portfolio that was just built may still have some short weights. This happens when the scaled short weights in the LS portfolio are larger than the benchmark portfolio. • There are a number of ways to deal with this: • Zero out the short weight and rescale the entire portfolio back to 100% • Find “similar characteristic” stocks and spread the excess short weights to those stocks. • Similar characteristics stocks will be other underweight stocks which are being shorted for “roughly” the same reason(s)

  12. Why don’t we optimize? • We will talk more about issues with optimization. • Generally, optimization doesn’t work. • The estimation errors are so large that the optimization generally gives too much weight to the most over-estimated stocks. (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC

  13. Passive Strategies (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC

  14. Passive Strategies • The first quant product was an index • Quant departments usually also handles index • BGI, SSgA and Mellon Capital were the very original index shops and have since become the biggest quant investment shops

  15. Index Fund Performance • Pure replication passive index fund has 0% chance of outperforming its benchmark • But it has 70% chance of outperforming an active manager! • The remaining part of the course will be focused on how to outperform a benchmark index.

  16. Passive Index Plus • Producing Index + returns • Securities Lending • Investment bank (keeps 50% of the lending income) • Collateral Management • Cash management service charges mgmt fee

  17. Portable Alpha • Gain index exposure through futures contract • Actively manage the cash collateral to take additional risk

  18. Enhanced Indexes • Producing Index + returns • Tilting toward quant factors to earn premium • Tilting toward value and small cap under a tight TE

  19. Passive Approach to Outperformance • This approach claims (empirically or theoretically) that the “cap-weighted” benchmark is sub-optimal. • It tries to build a better (more optimal) portfolio • This approach does not start with the benchmark and then try to create active weights against the benchmark, it just builds a new portfolio from scratch

  20. MVOOptimal Portfolio (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC

  21. MVO as a method for outperformance • MV Optimal portfolio construction • Why do cap weighting? • We have no reason to believe that it is MVO • So you can create a better passive portfolio by directly trying to build an MVO portfolio

  22. Tangency Portfolio • What do we need to construct the tangency portfolio • Mean and covariance • Does it matter that we estimate them very poorly? • As it turns out MVO is very sensitive to small variations in the inputs • Stocks which we estimate with big positive errors in the expected return will get enormous weights • MVO can often be very undiversified as a result • Numerically, how tractable is this method? • Since we impose a long only constraint, we need to numerically solve for the MVO. This is very difficult in practice when we need to deal with hundreds of stocks. • Simply using the algebraic solution and cutting out the negative weight does not lead to a good approximation for the true tangency portfolio

  23. Naïve Tangent Portfolio • Let’s use the most naïve asset pricing model to set expected stock returns: using sample estimates • Empirically, how well does this method work? • Not so good! • Empirically, this method consistently underperforms equal weighting!

  24. Naïve Tangent Portfolio • Why doesn’t it work? • MVO is guaranteed to be ex ante optimal if inputs are correct. • However, what if inputs are not 100% correct? What if they are only “generally” correct? • Using some useful information should still be better than EW, which is almost entirely without information! • However, since sample averages tend to significantly over or under-estimate true mean, this information actually appears to be “almost useless” or even harmful when combined with MVO • We will discuss how to implement MVO better (later)

  25. MVO Portfolio • Ingredients: • Define a stock universe • Assign returns for stocks • Estimate the covariance matrix for stocks (C) Created by Jason C. Hsu for use by UCLA Anderson and Research Affiliates, LLC

  26. Estimating Future Stock Returns • If stock returns were ergodic (iid) then past return information is indicative of future returns • So historical realized returns tell us something about future likely returns; but just how accurate can we forecast?

  27. Mean Estimates with High Frequency Data • Great the more data the better. We can always go to higher frequency. • But that doesn’t rally help. Going from monthly data over 10 years (m=120), to 10 years of weekly data (m=520), the std err on the weekly expected return becomes smaller, but when you annualize, the std err on the annualized forecast remains the same!

  28. Estimating Returns • The average individual arithmetic stock return is about 10% per annum • The average stock vol is 30% • With 10 years of data, your std err on annual arithmetic return estimate is 9.5%! You couldn’t really say if the realized 10% return was different from 0% • Easier to estimate lower vol portfolios (like an index) • How much time would you need in order to conclude that a stock actually has positive expected return? • Need std err to be less than 5% (so that 10% is 2 stddev away from zero) • So T needs to be at least 36 years!

  29. What if expected returns are time-varying instead of ergodic? • If expected returns for stocks are time-varying, then having a long time history of data doesn’t help, because you are not just using the data to estimate the parameters of one stationary distribution.

  30. Modeling Expected Returns • Even if returns were ergodic, it is much too hard to estimate them with any reliability. So we will need to assume some structure—build a theory that allows us to use more data or to estimate fewer parameters, etc. • We will spend a lot of time working on these asset pricing models, which help us forecast returns.

  31. Asset Pricing Model • All reasonable asset pricing model tries to relate risk to expected return. • This then allows us to use higher moment distributional parameters to give us information on the first moment • If return is related to vol of a stock, then we can use the vol estimate to help us estimate expected returns. We can use high frequency data to improve on the expected return estimate then! • Think CAPM. Why is CAPM pricing equation more useful than historical average return for estimating future expected stock returns?

  32. APT • …+ • A few risk factors which drive much of the aggregate (undiversifiable risk) in the economy • Exposure to these factors usually pay a premium (some might pay no premium which others pay a lot of premium) • We can figure out the expected return for a stock by estimating its exposure to the factors • We need to estimate the factor premiums as well

  33. Return forecasting • The practice of return forecasting in excess of the APT model is generally involved with • Forecasting the idiosyncratic error component of stock returns (inside information, insights into mispricing) • Forecasting the time-varying risk premium associated with the factors.

  34. Estimating Returns and Using them! • So you have estimated returns. You probably want to use them for something useful! • This is when you need to do more work! • Your return estimates are plagued with outliers. These outliers will hurt your portfolio strategy. • Shrinkage approach. • Create biased estimates, but more useful estimate. • Idea is to reduce the effects of the outlier estimates by shrinking them toward the mean

  35. Estimating Factor Portfolio Mean Return • How do we identify the factor portfolio mean return? • Take the same arithmetic average return for each factor portfolio • This turns out not to be the best way • The better way is the Fama-MacBeth Approach • Intuitively, we want to use the cross-sectional stock information to help us improve our mean estimate

  36. APT Model and Fama-MacBeth First stage we run the following cross-sectional regression to estimate betas for each stock on the factors For the second stage, we take expectation of the APT model to get the following expression This is now a cross-sectional regression that we can use to estimate the factor premium

  37. MVO • Once you have the expected factor returns estimated, you can estimate the expected stock returns for each individual stocks • Now you can apply MVO! • But your optimizer probably will struggle to deal with optimizing 1000 stocks! • In fact, if you want to run a back test, MVO over 100 stocks will make your back test extremely slow. • The more stocks you add to the MVO, the less robust the output can become

  38. Estimating Covariance Matrix • We will need to estimate the covariance matrix for a variety of reasons • As input into computing min-var and MVO portfolios • As input for estimating ex ante portfolio volatility

  39. Issues with Estimating Covariance Matrix • The N x N covariance matrix contains • N unique variance terms • 0.5* N * (N -1) unique correlation terms • For S&P500 stocks, you will have to estimate 62,250 unique parameters • You will need at least that many data points (62,250) to estimate a full rank covariance matrix • If you want to have “small” standard error on the variance estimate, you will need to have at least 200 observations (per stock) • This is about 17 years of monthly data, which is a unwieldy in a backtest • Using daily data will solve this issue. *Recall the formula for standard error for vol:

  40. Issues with Cov Matrix Estimation • As with estimating the mean, the Cov matrix estimate can be noisy, though that issue is significantly reduced by high frequency data. • However, there are techniques for improving the accuracy of the Cov matrix which you should be aware of • Covariance shrinkage • PCA

  41. Covariance Shrinkage (1/2) • This is related to the Shrinkage which we described for shrinking sample mean estimates. • The shrinked Covariance Matrix has the form: • Where S is also N x N, but contains only two distinct numbers: • Diagonal elements are all = average(the N stock sample variance) • Off-diagonal elements are all = average (the 0.5 * N * (N-1) off diagonal covariances)

  42. Covariance Shrinkage (2/2) • The shrinkage parameter is “the” art • You can use an explicit solution recommended by Olivier and Ledoit (2003); it’s a very ugly beast. • You can also just estimate using a quick and dirty empirical step. • Step #1, define in-sample period for estimating sample covariance and shrinkage target • Step #2, use the out-of-sample period to estimate out of sample predicted ; optimize to minimize the squared deviation of the 0.5 * N * (N-1) elements from our shrinkage target.

  43. PCA Approach (1/3) • Start with APT framework • Stock movements can be modeled as driven by a few common (orthogonal) factors + idiosyncratic noise • …+ …+ …+ • You now only need to estimate N x k , k , and N ) • For 500 stocks and 5 APT factors, that’s only 2500+5+500 or 3005 elements instead of 62250 elements in a unrestricted model.

  44. PCA (2/3) • So how do you get the APT factors and how do you make them orthogonal? • We use a statistical approach called the Principal Component Analysis • This technique essentially examines the covariance matrix and extracts the eigenvectors from the covariance matrix and sort the eigenvectors by their eigenvalue • Intuitively, the method finds the linear combination of the N time series of stock returns such that the resulting portfolio explains the greatest total variance. • We the extract the next portfolio which explains the greatest amount of the residual variance.

  45. PCA (3/3) • Once you identify the PCs, you can estimate the N x k , k , and N ) parameters • by running regression to get • Taking the vol of the PC portfolios to estimate • And then backing out from the total variance of each stock

  46. MVO • Once you have the expected factor returns estimated, you can estimate the expected stock returns for each individual stocks • Now you can apply MVO! • But your optimizer probably will struggle to deal with optimizing 1000 stocks! • In fact, if you want to run a back test, MVO over 100 stocks will make your back test extremely slow. • The more stocks you add to the MVO, the less robust the output can become

  47. MVO in Factor Space • We apply MVO in the factor space • Since only the factors earn a risk premium, we can largely ignore idiosyncratic volatility • So why don’t we just optimize a portfolio of the k factors? • That’s exactly what we should do! • Apply MVO to the factors instead of the stocks will achieve a more robust portfolio than doing MVO on the 500 stocks!

  48. MVO and Estimation Errors • Michaud Resampling Technique (parameter uncertainty technique) • The issue with MVO is that your mean and covariance estimates are estimates with errors. They are not true distribution parameters known with certainty. So you need to adjust for that. • Use a bootstrap resampling technique • Start with the full sample of history • Randomly select T dates to form a new sub-sample of data • Use the sub-sample to compute all parameters (this works whether you use the average return model, the APT or CAPM); then form the MVO portfolio • Repeat this process hundreds of times (M). • Average over all M MVO portfolios

  49. Minimum Variance • Minimum Variance is clearly not an optimal portfolio and should generally underperform most portfolios in the SR space.

  50. Minimum Variance • Empirically, minimum variance has significantly higher SR than most known portfolio strategies! It outperforms cap-weighting handily. • Under what conditions would minimum-variance be optimal? • Optimal if expected returns are equal for all stock (since we are only focused on achieving the lowest vol portfolio)

More Related