Train Your TV Zhi Li and Borja Peleato Term Project for CS229 Machine Learning, Stanford University Experimental Results Abstract

10

Prediction Using Video

9 8

RoI Euclidean Distance

We study the problem of predicting the users’ viewing behavior in an Interactive TV application, using soccer match as an example. Based on the information extracted from the video frames and the user’s Region of Interest (RoI) trajectory history, we make the prediction of user’s RoI in the frames ahead of time. We start with a generic probabilistic model, make simplifications and develop a tractable system. Lastly, we verify its performance through a set of experimental results and a live demo.

7 6 5 4 3 2

 t+n We discretize each video frame into blocks and for each block p we compute Pr It+n(p) | f , where It+n(p) is the indicator function of block p being in RoI in frame (t + n). We further make the following simplifications:

High-resolution video is available at the server. Both a low-resolution overview video and an enhancementlayer video of the RoI predicted by the system are streamed to the user. Accurate prediction of the RoI will lead to higher video quality (or less distortion) at the user side. We will allow some streaming start-up delay and send some number of frames of the thumbnail video ahead of time. Hence, the inputs to our module are the trajectory of the user’s RoI history and the thumbnail video up to the frame of prediction. The performance will be evaluated based on the Euclidean distance between the predicted RoI trajectory and the actual one.

The second equation makes the assumption that It+n(p) depends only on a local patch ξt+n around p and the actual location p. To compute Pr(It+n(p) | ξt+n(p), p), we select features to reflect ξt+n(p) and p, and use a logistic regression model. The features we select include: DIST BALL(p), M OV (p), N U M P LAY ERS(p) and LOCAL LABEL(p).

Prediction Using Trajectory The available information is the user’s recent RoI trajectory and his behavior from watching previous matches Φ1, Φ2, Φ3, ..., ΦN . We believe that there exist “typical trajectories” that users very commonly follow which would be difficult to capture under a single set of parameters. Hence, we divide the trajectories into clusters and perform an independent regression within each cluster. Parameters and cluster centroids are computed in an offline training phase and stored to be used at prediction time. While the user is watching the match, his behavior is periodically classified into one of the clusters and his future RoI center is predicted using the corresponding coefficients. His RoI window is modeled as a Gaussian convolved with the current window.

...

1 0

Ball

Audience

Players

Mixed

Distance between predictions and correct RoI centers for different user behaviors and prediction schemes. Users can follow the ball, scan the audience, look at the players or switch randomly. Predictions respectively use video features (good for well localized behaviors), recent trajectory (good for changing behaviors), a smart combination of the previous two, and linear extrapolation (comparison benchmark). 8 7 RoI Euclidean Distance

Pr(It+n(p) | f t+n) = Pr(It+n(p) | ft+n) = Pr(It+n(p) | ξt+n(p), p)

Environment Setup

Video Information Trajectory History Video Combined with Trajectory Simple Extrapolation

Video Frame Partioned Into Blocks (Frame #182)

6

Video Information Trajectory History Video Combined with Trajectory Simple Extrapolation

5 4 3 2 1

Trajectory

Current Time

Look-ahead Window

0 0

Time of Prediction Region Map Ceiling Scoreboard

2

Ball 4 Detection

Probabilistic Model Suppose we have a sequence of overview video frames {fi}. On each frame the user can indicate a RoI φi. Suppose currently we are at time t, we want to make a prediction of the user’s RoI at n steps ahead, i.e., φt+n. The information available for our prediction includes: 1) the overview video up to frame (t + n), i.e., f t+n = {. . . , ft+n−2, ft+n−1, ft+n}, and 2) the user’s RoI trajectory history up to frame t, i.e., φt = {. . . , φt−2, φt−1, φt}. Based on all the information available, we make a prediction using:   φˆt+n = arg max p φt+n | f t+n, φt φt+n     = arg max p φt+n | f t+n p φt+n | φt . φt+n

We assume Markovity (f t+n ↔ φt+n ↔ φt) and uniform prior probability p(φt+n).

Player Detection

Audience

Ceiling Scoreboard

Audience

4

Goalmouth

6 Field

8 10

6

Goalmouth 5 10 15

Ads 20

25

30

35

Segment

40

2

3

4 5 lookahead time

6

7

8

Field

8 10

Distance To Ball

Region Map

2

1

Error vs. lookahead time. For short times, trajectory predictions are better, but as we try to predict further in time, the video features offer a very reliable source of information. We used a lookahead of 3 seconds.

5

Movement 10 15 Intensity

Ads Number of 20 25

Players

Minimum Distance

35 Local 40

30

Labels

4

Centroids

K-Means Clustering

Video Information Trajectory History Video Combined with Trajectory Simple Extrapolation

3.5 Training Feature Vector

Least Squares

Label

Least Squares

Linear Regression

. . .

Logistic Regression

LSE Coeffs

Least Squares Least Squares

Training Offline

Query Feature Vector Probability Map

Prediction Using Trajectory History Prediction Using Trajectory History

2

2

4

4

6

6

8

8

10

10

10

15

20

3

2.5

2

Prediction Using Video

5

RoI Euclidean Distance

Training Window

25

30

35

40

Combining Information

5

10

15

20

25

30

35

2 4 6 8 10 5

10

15

20

25

30

35

40

40

1.5 3

4

5

6

7

8 9 history length

10

11

12

13

Error vs. training window duration. Longer training times slightly improve the accuracy of trajectory predictions, but at the cost of increased complexity and feature degradation. We decided to base our predictions on the past 4 seconds.

Experimental Results Prediction Using Video Prediction ...

RoI Euclidean Distance. Video Information. Trajectory History. Video Combined ... Training. Feature Vector. Logistic. Regression. Label. Query Feature Vector.

195KB Sizes 1 Downloads 303 Views

Recommend Documents

Anesthesia Prediction Using Fuzzy Logic - IJRIT
Thus a system proposed based on fuzzy controller to administer a proper dose of ... guide in developing new anesthesia control systems for patients based on ..... International conference on “control, automation, communication and energy ...

Experimental Validation of Channel State Prediction ...
Radio companion (GRC) runs on the host computer with Linux operating system. Using the ..... [14] Y. Tachwali, M. Chmeiseh, F. Basma, and H. H. Refai, “A frequency .... reliable and secure state estimation in smart grid,” in Proceedings of.

Fast Intra Prediction for High Efficiency Video Coding
adopted in the HEVC reference software HM [5]. First, a rough mode decision is performed to ... algorithm is implemented and reported on HM3.0 software, and claims 40% encoding time reduction. However, early SKIP ..... ciency video coding (hevc) text

Geometry-Based Next Frame Prediction from Monocular Video
use generative computer graphics to predict the next frame to be observed. ... Recent frame prediction methods based on neural net- works [28], [24], [26], [14] ..... three-dimensional point cloud C. The x,y,z coordinates of the projected points in .

Prediction markets
Management and Sustainable Development, Vol. ... The essential problem of management is to transform a company's strategic objectives .... used by Siemens to predict a large software project's completion date. .... Boca Raton, Florida, USA.

Prediction markets - CiteSeerX
management, logistics, forecasting and the design of production systems. ... research into and assessment of business applications of various forecasting ...

Prediction markets - CiteSeerX
aggregation and transmission of information through prices. Twenty years ... The first business application however took place some years later. In Ortner .... that will provide the environment for hosting such business games is already under.

Structured Prediction
Sep 16, 2014 - Testing - 3D Point Cloud Classification. • Five labels. • Building, ground, poles/tree trunks, vegetation, wires. • Creating graphical model.

Prediction markets
subjects such as data mining and prediction markets. I. Tatsiopoulos is a Professor in Production ... data-driven, self-adaptive method that comprises a universal non-linear functional approximation and has an extensive .... Other considerations conc

Protein location prediction using atomic composition ...
Dec 28, 2009 - subcellular localization and the second is the computational tech- nique employed for making prediction [1]. The biological features used for prediction include detection of protein sorting signal, ami- no acid composition, physiochemi

Program Behavior Prediction Using a Statistical Metric ... - Canturk Isci
Jun 14, 2010 - Adaptive computing systems rely on predictions of program ... eling workload behavior as a language modeling problem. .... r. LastValue. Table-1024. SMM-Global. Figure 2: Prediction accuracy of our predictor, last-value and ...

Rating Prediction using Feature Words Extracted from ...
“Seiichi," shown in the Course table, is known as a famous golf course designer who has designed many golf courses in Japan. The negative side of the Course table includes words such as “weed,". “river," and “sand pit." Because a customer's l

HEADS: Headline Generation as Sequence Prediction Using an ...
May 31, 2015 - tistical models for headline generation, training of the models, and their ... lems suffered by traditional metrics for auto- matically evaluating the ...

Knowledge Extraction and Outcome Prediction using Medical Notes
to perform analysis on patient data. By training a number of statistical machine learning classifiers over the unstructured text found in admission notes and ...

Feature Selection using Probabilistic Prediction of ...
selection method for Support Vector Regression (SVR) using its probabilistic ... (fax: +65 67791459; Email: [email protected]; [email protected]).

Russian Stress Prediction using Maximum ... - Research at Google
performs best in identifying both primary ... rived directly from labeled training data (Dou et al., 2009). ..... Computer Speech and Language, 2:235–272.

Single-Step Prediction of Chaotic Time Series Using ...
typical application of neural networks. Particularly, .... Equations (7) and (9) express that a signal 1РBС is decomposed in details ..... American Association for the.

Geolocation Prediction in Twitter Using Location ...
location-based recommendation (Ye et al., 2010), crisis detection and management (Sakaki et al., ... Section 2 describes our proposed approach, including data ..... Using friendship (bi-directional) and following (uni-directional) links to infer the 

Prediction of Channel State for Cognitive Radio Using ...
ity, an algorithm named AA-HMM is proposed in this paper as follows. It derives from the Viterbi algorithm for first-order. HMM [20]. 1) Initialization. âiRiR+1 ...

Program Behavior Prediction Using a Statistical Metric ... - Canturk Isci
Jun 14, 2010 - P(s4 | s3) s4. P(s4). P(s3). P(s2). P(s1). Probability. Figure 1: Model with back-off for n = 4. The statistical metric model is a conditional ...

Using Prediction Markets to Track Information Flows - Department of ...
Jan 6, 2008 - Using data on the precise latitude and longitude of employees' offices, we found that prediction market ... 4 As discussed below, in all data analyzed by the external researchers on this project, Google employees were anonymized and ...

Student Result and Placement Prediction Using ...
collage all of them are from different background to identify the student performance manually and which student provide which type of treatment manually it is very time consuming process. there are many attempts to address this challenge, automatic

Customer Churn Prediction Model using Machine ...
using machine learning algorithms. ... card programs are also more likely to churn. ... service providers, or companies providing them with cellular network ...