1 / 10

Apartment Cloud

Apartment Cloud. Noah Callaway Zac Fleischmann Zak Nelson Brandon Zahl. Aggregate apartments listings from all across the internet to create a… … simple, one-stop, apartment search Aggregate apartment listings from top sites. (Washington state only)

Download Presentation

Apartment Cloud

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Apartment Cloud Noah Callaway Zac Fleischmann Zak Nelson Brandon Zahl

  2. Aggregate apartments listings from all across the internet to create a… …simple, one-stop, apartment search Aggregate apartment listings from top sites. (Washington state only) …mostly one-stop apartment search. …mostly simple. Aspirations / Reality

  3. Brandon – Site specific extractors Statistics Noah – Server configuration Front-end development Zac – Site specific extractors Advanced Search Zak – Crawler / Aggregator Commute distance feature Building It

  4. Page Extraction Statistics

  5. Extraction Accuracy Statistics

  6. Much higher accuracy on the structured pages versus unstructured craigslist • Craigslist is candidate for machine learning • Machine learning likely worse on others Experiment Conclusion

  7. How to configure Amazon Web Services with a LAMP stack • How to create a web application with AJAX • How to use Jobo and Nutch for web crawling • How to parse HTML for pertinent data • The considerations of starting a web business What we learned

  8. Amazon Web Services was slower than a $7/month virtual server • Most of the large listing sites were surprisingly easy to extract data from • Aggregating information from the web is legally tricky Unexpected Outcomes

  9. Better version control • More pre-coding design • More quality control and testing • More extensible extractors (Maybe an existing HTML parser) Things We’d Do Differently

  10. Demo

More Related