-
Notifications
You must be signed in to change notification settings - Fork 34
/
pens.html
201 lines (141 loc) · 10.8 KB
/
pens.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
<!DOCTYPE html>
<html >
<head>
<!-- Site made with Mobirise Website Builder v5.3.10, https://mobirise.com -->
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="generator" content="Mobirise v5.3.10, mobirise.com">
<meta name="viewport" content="width=device-width, initial-scale=1, minimum-scale=1">
<link rel="shortcut icon" href="assets/images/output-onlinepngtools-160x160.png" type="image/x-icon">
<meta name="description" content="">
<title>Home</title>
<link rel="stylesheet" href="assets/web/assets/mobirise-icons/mobirise-icons.css">
<link rel="stylesheet" href="assets/bootstrap/css/bootstrap.min.css">
<link rel="stylesheet" href="assets/bootstrap/css/bootstrap-grid.min.css">
<link rel="stylesheet" href="assets/bootstrap/css/bootstrap-reboot.min.css">
<link rel="stylesheet" href="assets/tether/tether.min.css">
<link rel="stylesheet" href="assets/dropdown/css/style.css">
<link rel="stylesheet" href="assets/theme/css/style.css">
<link rel="preload" href="https://fonts.googleapis.com/css?family=Rubik:300,400,500,600,700,800,900,300i,400i,500i,600i,700i,800i,900i&display=swap" as="style" onload="this.onload=null;this.rel='stylesheet'">
<noscript><link rel="stylesheet" href="https://fonts.googleapis.com/css?family=Rubik:300,400,500,600,700,800,900,300i,400i,500i,600i,700i,800i,900i&display=swap"></noscript>
<link rel="preload" as="style" href="assets/mobirise/css/mbr-additional.css"><link rel="stylesheet" href="assets/mobirise/css/mbr-additional.css" type="text/css">
</head>
<body>
<section class="menu cid-sykHwG5qtz" once="menu" id="menu2-28">
<nav class="navbar navbar-expand beta-menu navbar-dropdown align-items-center navbar-fixed-top navbar-toggleable-sm bg-color transparent">
<button class="navbar-toggler navbar-toggler-right" type="button" data-toggle="collapse" data-target="#navbarSupportedContent" aria-controls="navbarSupportedContent" aria-expanded="false" aria-label="Toggle navigation">
<div class="hamburger">
<span></span>
<span></span>
<span></span>
<span></span>
</div>
</button>
<div class="menu-logo">
<div class="navbar-brand">
<span class="navbar-logo">
<a href="pens.html">
<img src="assets/images/output-onlinepngtools-160x160.png" alt="" style="height: 5rem;">
</a>
</span>
<span class="navbar-caption-wrap"><a class="navbar-caption text-black display-4" href="https://mobiri.se"></a></span>
</div>
</div>
<div class="collapse navbar-collapse" id="navbarSupportedContent">
<ul class="navbar-nav nav-dropdown nav-right" data-app-modern-menu="true"><li class="nav-item dropdown"><a class="nav-link link text-black dropdown-toggle display-4" href="#" aria-expanded="false" data-toggle="dropdown-submenu">
About PENS </a><div class="dropdown-menu"><a class="text-black dropdown-item text-primary display-4" href="pens_model.html" aria-expanded="false">MODEL</a><a class="text-black dropdown-item text-primary display-4" href="pens_data.html" aria-expanded="false">DATA</a></div></li></ul>
</div>
</nav>
</section>
<section class="header1 cid-sxdpQ08Ub1" id="header16-6">
<div class="container">
<div class="row justify-content-md-center">
<div class="col-md-10 align-center">
<h3 class="mbr-section-subtitle mbr-light pb-3 mbr-fonts-style display-2"><strong>PENS: PErsonalized News headlineS </strong></h3>
<p class="mbr-text pb-3 mbr-fonts-style display-5">An English Dataset for Personalized <br>News Headline Generation Research</p>
</div>
</div>
</div>
</section>
<section class="header1 cid-sxgucHlYoa" id="header16-z">
<div class="container">
<div class="row justify-content-md-center">
<div class="col-md-10 align-center">
<h3 class="mbr-section-subtitle mbr-light pb-3 mbr-fonts-style display-5"><strong>About PENS</strong></h3>
<p class="mbr-text pb-3 mbr-fonts-style display-7">PENS is an English dataset for Personalized News Headline Generation Research. It contains two parts for training and test individually. The training set was collected from anonymized user impressions logs of <a href="https://microsoftnews.msn.com/" class="text-primary" target="_blank">Microsoft News</a> website, and the test set is manually-created by hundreds of native speakers to enable a fair testbed for evaluating models in an offline mode.<br><br>PENS contains about 113k English news articles whose topics are distributed into 15 categories and 500k impression logs generated by over 445k users for training. In detail, every news article contains rich textual content including title, body, category and corresponding entities. Each impression log contains the click events, non-clicked events and historical news click behaviors of this user before this impression. To provide an offline testbed, we invited 103 English native speakers to manually create a test set by two stages. In detail, there are over 100k personalized news headlines generated. <br><br>Notice that each user was de-linked from the production system when securely hashed into an anonymized ID to protect user privacy. For more detailed information about the PENS dataset, you can refer to the following paper:</p>
</div>
</div>
</div>
</section>
<section class="mbr-section content8 cid-sxgtxzi0Hb" id="content8-x">
<div class="container">
<div class="media-container-row title">
<div class="col-12 col-md-8">
<div class="mbr-section-btn align-center"><a class="btn btn-primary-outline display-4" href="https://www.microsoft.com/en-us/research/uploads/prod/2021/06/ACL2021_PENS_Camera_Ready_1862_Paper.pdf" target="_blank">PENS paper (Ao et al., ACL 2021)</a></div>
</div>
</div>
</div>
</section>
<section class="header1 cid-sxgturxTWr" id="header16-w">
<div class="container">
<div class="row justify-content-md-center">
<div class="col-md-10 align-center">
<h1 class="mbr-section-title mbr-bold pb-3 mbr-fonts-style display-5">
Download</h1>
<p class="mbr-text pb-3 mbr-fonts-style display-7">
The PENS dataset is free to download for research purposes under <em><strong><a href="https://github.com/msnews/MIND/blob/master/MSR%20License_Data.pdf" class="text-primary" target="_blank">Microsoft Research License Terms</a></strong></em>. Before you download the dataset, please read these terms first.<br><br>This dataset supports research on personalized news headline generation, and can be downloaded via:<br></p>
</div>
</div>
</div>
</section>
<section class="mbr-section content8 cid-sxgwVDzid3" id="content8-11">
<div class="container">
<div class="media-container-row title">
<div class="col-12 col-md-8">
<div class="mbr-section-btn align-center"><a class="btn btn-primary-outline display-4" href="https://mind201910small.blob.core.windows.net/release/PENS.tar.gz"> PENS Dataset</a></div>
</div>
</div>
</div>
</section>
<section class="features3 cid-sxdqgZANAc" id="features3-9">
<div class="container">
<div class="media-container-row">
<div class="card p-3 col-12 col-md-6 col-lg-4">
<div class="card-wrapper">
<div class="card-box">
<h4 class="card-title mbr-fonts-style display-5">Dataset Description</h4>
<p class="mbr-text mbr-fonts-style display-7">
An introduction to the details of PENS dataset, including the statistics and some cases.</p>
</div>
<div class="mbr-section-btn text-center"><a href="pens_data.html" class="btn btn-primary-outline display-4">
Learn More
</a></div>
</div>
</div>
<div class="card p-3 col-12 col-md-6 col-lg-4">
<div class="card-wrapper">
<div class="card-box">
<h4 class="card-title mbr-fonts-style display-5">Model Framework</h4>
<p class="mbr-text mbr-fonts-style display-7">We proposed a basic and generic framework for the problem of personalized headline generation.</p>
</div>
<div class="mbr-section-btn text-center"><a href="pens_model.html" class="btn btn-primary-outline display-4">
Learn More
</a></div>
</div>
</div>
<div class="card p-3 col-12 col-md-6 col-lg-4">
<div class="card-wrapper">
<div class="card-box">
<h4 class="card-title mbr-fonts-style display-5">Code</h4>
<p class="mbr-text mbr-fonts-style display-7">The code of our work with PENS dataset is provided and will be updated later. <br></p>
</div>
<div class="mbr-section-btn text-center"><a href="https://github.com/LLluoling/PENS-Personalized-News-Headline-Generation" class="btn btn-primary-outline display-4" target="_blank">
Learn More
</a></div>
</div>
</div>
</div>
</div>
</section><section style="background-color: #fff; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', 'Roboto', 'Helvetica Neue', Arial, sans-serif; color:#aaa; font-size:12px; padding: 0; align-items: center; display: flex;"><a href="https://mobirise.site/e" style="flex: 1 1; height: 3rem; padding-left: 1rem;"></a><p style="flex: 0 0 auto; margin:0; padding-right:1rem;"><a href="https://mobirise.site/s" style="color:#aaa;">Site</a> was built with Mobirise website theme</p></section><script src="assets/web/assets/jquery/jquery.min.js"></script> <script src="assets/popper/popper.min.js"></script> <script src="assets/bootstrap/js/bootstrap.min.js"></script> <script src="assets/tether/tether.min.js"></script> <script src="assets/smoothscroll/smooth-scroll.js"></script> <script src="assets/dropdown/js/nav-dropdown.js"></script> <script src="assets/dropdown/js/navbar-dropdown.js"></script> <script src="assets/touchswipe/jquery.touch-swipe.min.js"></script> <script src="assets/theme/js/script.js"></script>
</body>
</html>