80 likes | 206 Views
This project outlines a high-level architecture for improving estimated click-through rates (eCTR) using latent topic modeling, user encoding, and clustering frameworks. The pipeline includes encoding user data through an auto-encoder for dimension reduction and clustering users based on political affiliations. The output will be stored in a Hive table containing user IDs and their low-dimensional representations. Additionally, optional user clustering stages will be explored to refine prediction accuracy. Proposed models and algorithms are detailed alongside project plans and resource allocation.
E N D
High-Level Architecture Users Ads User Encoding UserEncoding UserClustering Prediction eCTR / FB Prediction
Existing Pipeline • Encoding • Auto-encoder for dimension reduction • Political affiliation clustering • Output: Hive table (user id + low-dim representation) • eCTR prediction • Optional: user clustering stage