RTB-research

RTB(Real-Time-Bidding) research walk through.

This is my material repo about the Computional Advertising. That's about the resource i read, some notes and memo.

Welcome to make friends in this industry.

Book

Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting by Jun Wang, Weinan Zhang and Shuai Yuan. ArXiv 2016.

A very detail introduction book about RTB industry and technic detail with many paper reference.
Computational Advertising by Peng Liu, Chao Wang. 2015

A good introduction book from almost all perspectives to talking about digital advertising. Copyright reserve. So no link..

Paper

Bidding

Optimal Real-Time Bidding for Display Advertising by Weinan Zhang. PhD Thesis 2016.

Very excellent work about bid optimalization. Detail research in the future.
Optimal Real-Time Bidding for Display Advertising by Weinan Zhang, Shuai Yuan, Jun Wang. KDD 2014.
This paper give theory foundation of optimal bidding algorithm which take budget and win rate into consideration. It prove to be an nonlinear bidding. The experiment result is better than the linear one.
Bid Optimizing and Inventory Scoring in Targeted Online Advertising by Claudia Perlich et al. KDD 2012. PPT is here

This paper put forward the theory of a bidding algorithm now known as linear bidding. Use CTR as user experience feedback to adjust the system. But the linear bidding doesn’t take budget into consideration and will spent the money quickly if there’s no pace control.

CTR

Simple and Scalable Response Prediction for Display Advertising by Olivier Chapelle Criteo, Eren Manavoglu, Romer Rosales. ACM TIST 2014.

Very specific CTR modeling process. Including model introduce, tricks of feature engineering, feature conjunctions, try multitasks learning, subsampling, regulazation, smoothing, calibration. Some useful experiment like Exploration/Exploitation. But a little obsolete for today (2017).
Practical Lessons from Predicting Clicks on Ads at Facebook by Xinran He et al. ADKDD 2014.

This paper formulate Normalized-Entropy as metrics and any other metrics explained. It also put forward the practice of GBDT+LR. (A method use GBDT(gradient boosting decision tree) to do feature transformation.
Ad Click Prediction: a View from the Trenches by H. Brendan McMahan. KDD 2013.

This paper have a lot of pratical skills and industry experience about CTR, including sparsification, feature enginering, validation and final calibration. It also put forward how to estimate the unconfidence pCTR. The calibration is about possion regression or isotonic regression.(Need further check.)
Predictive Model Performance: Offline and Online Evaluations by Jeonghee Yi, Ye Chen et al. KDD 2013.

This paper do the summary for now about the evaluation metrics of model performance including but not except sponsor search, RTB..et al. AUC is not good enough to evaluate model performance, RIG is not much good for model compare.

Estimating Conversion Rate in Display Advertising from Past Performance Data by Kuang-chih Lee et al. KDD 2012.

This paper explain a method like clustering the similar user whom has conversion to deal with the sparsity data. It also include some pratical technic skills like imbalanced dataset processing, result calibration. Experiment prove that the skewness would not affect the model performance but the probability scores it output and it still need calibrate.
T. Graepel, J. Quin ̃onero Candela, T. Borchert, and R. Herbrich. Web-scale bayesian click-through rate prediction for sponsored search advertising in Microsoft’s Bing search engine. In ICML, pages 13–20, 2010.

The Bayes methods of CTR.

Pacing

Budget Pacing for Targeted Online Advertisements at LinkedIn by Deepak Agarwal et al. KDD 2014. Pace control according to the incoming traffic. Cheap but effective. Useful information about the A/B experiment.

Other Technique Tricks

Cheng H T, Koc L, Harmsen J, et al. Wide & Deep Learning for Recommender Systems[J]. 2016

The paper have a detail explain the implimentation in Google Play. Use wide model to memorize data and deep model to generialize and using joint learning to learn the model. Google open source the code in tensorflow.
Steffen Rendle (2010): Factorization Machines, in Proceedings of the 10th IEEE International Conference on Data Mining (ICDM 2010), Sydney, Australia.

Factorization machine. Widely used in Information Retrieve field. It extend LR, take 2nd cross-feature into consideration. The idea is simple. But very useful.
Feature Hashing for Large Scale Multitask Learning, by Kilian Weinberger et al. 2010

This paper show great details about feature hash, including mathematic prove, application in spam email filter. NLP problem. It could also apply when there's need to reduce RAM. But google shows that, hash trick may induce great bias when there's great collision.
Logistic Regression in Rare Events Data by Gary King and Langche Zeng. Political Analysis 2001

This paper is about correction in LR model in rare events data, the imbalanced dataset. Two correction methods, afterward correction and weight correction.

Dataset

iPinYou Real-Time Bidding Dataset for Computational Advertising Research

This dataset is the PC traffic data about RTB. It’s about 3 seasons data. Detail about the benchmark and the data dictionary is in this paper Real-Time Bidding Benchmarking with iPinYou Dataset by Dr.Zhang.

And the benchmark data needed some preprocess at this site, the environment of script use python2.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SUMMARY.md		SUMMARY.md
paper.md		paper.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RTB-research

RTB(Real-Time-Bidding) research walk through.

Book

Paper

Bidding

CTR

Pacing

Other Technique Tricks

Dataset

About

Releases

Packages

License

RochaC/RTB-research

Folders and files

Latest commit

History

Repository files navigation

RTB-research

RTB(Real-Time-Bidding) research walk through.

Book

Paper

Bidding

CTR

Pacing

Other Technique Tricks

Dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages