1 / 21

Equity Portfolio Analysis using Data Mining

Data Mining Prof. Chris Volinsky. Equity Portfolio Analysis using Data Mining. Rajesh Shekhar. Objective. Use Data Mining techniques to build a portfolio with superior return/risk characteristics using technical indicators Maximize return Minimize risk

verdad
Download Presentation

Equity Portfolio Analysis using Data Mining

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Mining Prof. Chris Volinsky Equity Portfolio Analysis using Data Mining Rajesh Shekhar

  2. Objective • Use Data Mining techniques to build a portfolio with superior return/risk characteristics using technical indicators • Maximize return • Minimize risk • Build different momentum based strategies

  3. Approach • Risk Diversification • Select stocks across sectors for a natural diversification. • Virtual sectors created using k-means cluster algorithm • Return maximization • Use momentum based indicators to predict future returns • Try different trading algorithms

  4. Data Collection • Investment Universe: Large Market Cap Stocks (Top 100/300/500) • Data collected for everyday stock prices from WRDS (CRSP database) for the entire stock universe from 1999-2009 . • Custom benchmark of top 100/300/500 stocks was created as composition of S&P 500 was not known over the period

  5. Data Issues

  6. Virtual Sectors • Used k-means cluster to create virtual clusters • 11 clusters for 300/500 stock universe and 10 clusters for 100 stock universe • Input: β, Market Cap (Liquidity), P/E (Price/Earning) • βstock = cov(Rstock, Rmarket)/var(Rmarke) • β captures long term adjusted equilibrium rate of return

  7. Silhouette Plot

  8. Stock Selection Model • Different models tried for capturing momentum indicators (linear models (based on APT) • Best model to capture model momentum was: • Based on time decay of historical returns • r = kj*rj where r = predicted stock return j = time period (j=0 for the current time) k = constant achieved after calibration • More weights on recent data • Two years of moving window for prediction • Portfolio analysis and rebalancing every two weeks

  9. Strategies • Long Only • Short Only • Long-Short • Sector Rotation • Sector Portfolio Optimization

  10. Long Only • Basic Idea: Long top “n” performing stocks in each sector based on market cap • Portfolio Weights: All selected stocks are equally weighted in portfolio Short Only • Basic Idea: Short bottom “n” performing stocks in each sector based on market cap • Portfolio Weights: All selected stocks are equally weighted in portfolio

  11. Long / Short • Basic Idea: Combination of Long and Short • Portfolio Weights: All selected stocks are equally weighted in portfolio Sector Rotation Basic Idea: Long top performing sectors & short on bottom performing ones Portfolio Weights: Weight in each sector is proportional to return (More weight on the more outperforming sector; shorting allowed)

  12. Sector Portfolio Optimization • Basic Idea: Select stocks using long only strategy. • Portfolio Weights: Decided by Markowitz Portfolio optimization techniques • Sector Constraints : (weights vary from 1.1 to 0.9 of the target sector weights) • Asset Constraints (Shorting and leverage allowed): (weights vary from -0.1 to 1.1) • Allocation on the efficient frontier

  13. Implementation • MATLAB (Object Oriented) • SQL Server database (> 5 GB of raw data and with indexes 12GB) PortfolioReports &Graphs Portfolio Engine(MATLAB Code) Risk Analysis(MATLAB Code) Performance & Risk Report (SQL Server)Database

  14. Robustness (Testing) • Vary Input parameters • Stock universe (100/300/500) • Stock selected (10/20/40) • Running time window (2001-2002, 2005-2007) • Rebalancing period (15/21/30/45 days)

  15. Results: 100 stocks universe

  16. Results: 300 stocks universe

  17. Performance Measurement : Risk Metrics

  18. Performance Attribution • Benchmark : Custom Benchmark • Value-added Return = Pure sector allocation + Allocation/Selection interaction + Within-sector selection RV= + + Rv = the value-added return wP,j= portfolio weight of sector j wB,j= benchmark weight of sector j RP,j= portfolio return of sector j RB,j= benchmark return of sector j RB = return in the portfolio’s benchmark S = number of sectors

  19. Performance Attribution Results

  20. Other Issues • Transaction Costs: • Slippage cost and explicit costs are taken into account • Market impact and other implicit costs are ignored • Leverage costs are not taken into account • Portfolio Turnovernot taken into account

  21. Conclusions • Virtual sectors works reasonably well. • Time decay returns is a decent predictor of future returns in stable market for short time periods. • Statistically relevant for large market caps.

More Related