<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4294268551164982318</id><updated>2012-02-16T09:51:13.836-08:00</updated><category term='quantile regression'/><category term='regression'/><category term='absolute loss'/><category term='squared loss'/><title type='text'>COMS 4771</title><subtitle type='html'>Machine Learning (Spring 2008), Columbia University</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>12</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-8027483804741179118</id><published>2008-04-17T14:16:00.000-07:00</published><updated>2008-04-27T06:37:01.098-07:00</updated><title type='text'>Final</title><content type='html'>The final is worth 25 points (+7 bonus points), covering the lectures after the midterm.  The following lectures will not be covered:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Nearest neighbor methods (covered by the final assignment)&lt;/li&gt;&lt;li&gt;Prediction bounds (the last lecture before the midterm)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Large-scale learning (exponentiated gradient, VW), but the standard gradient method may be covered.&lt;/li&gt;&lt;/ul&gt;If necessary, the final will be curved.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-8027483804741179118?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/8027483804741179118/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=8027483804741179118' title='35 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8027483804741179118'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8027483804741179118'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/04/final.html' title='Final'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>35</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-7727438800901174793</id><published>2008-04-16T09:30:00.000-07:00</published><updated>2008-04-16T09:57:11.294-07:00</updated><title type='text'>Reading Assignments</title><content type='html'>Since a number of you chose this option, the oral quiz is not going to happen (even if it takes 10 minutes per paper, the quiz would take about 10 hours).  We can have a written quiz.  The quiz will take 30 minutes per paper and will start at 1pm on May 1.    (If you are doing only one paper, you can either come at 1pm and leave at 1:30pm, or come at 1:30pm and leave at 2pm.)  The final will be held at 2:40pm the same day.&lt;br /&gt;&lt;br /&gt;1) Please email me IF you want to do paper #2.  (As I recall, nobody expressed interest in reading it, so unless I hear from someone, the paper is off the list.)&lt;br /&gt;&lt;br /&gt;2) Please email me IF there is any time conflict.&lt;br /&gt;&lt;br /&gt;3) Please don't email me asking what type of questions there will be on the quiz.  You should understand the papers in depth.  You CAN use the papers on the quiz, but you can't use any other material.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-7727438800901174793?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/7727438800901174793/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=7727438800901174793' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/7727438800901174793'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/7727438800901174793'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/04/reading-assignments.html' title='Reading Assignments'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-6737960588419844029</id><published>2008-04-05T08:26:00.000-07:00</published><updated>2008-04-06T16:54:31.840-07:00</updated><title type='text'>Questions about final projects</title><content type='html'>&lt;blockquote&gt;&lt;span style="font-style:italic;"&gt;&lt;span style="font-style:italic;"&gt;- in Problem #1, do we have to come up with a new method, or can we use &lt;br /&gt;an already existing one (I was thinking about Kmeans, for example) ?&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;You can use an existing method, provided that you are happy with how it performs on the problem.  The webpage describing the dataset has a list of test error rates achievable by different methods, so you can see how well you are doing.&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;span style="font-style:italic;"&gt;how about the code? do we have to provide it as well? does it have to be &lt;br /&gt;100% original or can we use (and maybe adapt) toolboxes? (I saw for &lt;br /&gt;example you pointed us to weka). Are there restrictions on the language?&lt;/span&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;You have to provide the code.  You can use toolboxes.  The code doesn't have to be original if you can make it work well on the problem.  The grade *will* depend on the performance of other students.  There is no restriction on the programming language as long as you make it easy for me to run your solution to verify the reported test error rates. (Again, if you somehow use the test set to tune your solution, you will automatically get 0 points.)&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;span style="font-style:italic;"&gt;- If we turn in some projects before May 1st, will they be graded &lt;br /&gt;earlier (so that we get an idea whether we should attempt others) ?&lt;/span&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Yes, but every student will be given only one additional attempt.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;An important note&lt;/span&gt;: If you choose to do a reading assignment with a quiz, I will &lt;span style="font-weight:bold;"&gt;subtract points&lt;/span&gt; if you clearly don't understand an important concept from the paper.  So choose this option only if you are serious about it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-6737960588419844029?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/6737960588419844029/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=6737960588419844029' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/6737960588419844029'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/6737960588419844029'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/04/questions-about-final-projects.html' title='Questions about final projects'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-6199183879431610748</id><published>2008-04-02T20:24:00.001-07:00</published><updated>2008-04-02T20:43:54.480-07:00</updated><title type='text'>Nearest Neighbor Methods</title><content type='html'>&lt;a href="http://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction"&gt;Dimensionality reduction&lt;/a&gt;:&lt;br /&gt;&lt;a href="http://web.mit.edu/cocosci/isomap/isomap.html"&gt;Isomap&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.cs.toronto.edu/~roweis/lle/"&gt;Locally Linear Embedding (LLE)&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.weinbergerweb.net/publications/PDFs/AAAI0620WeinbergerK.pdf"&gt;Maximum variance unfolding&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.weinbergerweb.net/publications/PDFs/nips06.pdf"&gt;Distance Metric Learning for Large Margin&lt;br /&gt;Nearest Neighbor Classification&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Nearest neighbor schemes:&lt;br /&gt;Piotr Indyk's &lt;a href="http://people.csail.mit.edu/indyk/brown.ps"&gt;tutorial&lt;/a&gt;&lt;br /&gt;Ken Clarkson's &lt;a href="http://www.almaden.ibm.com/u/kclarkson/nn_survey/p.pdf"&gt;tutorial&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://hunch.net/~jl/projects/cover_tree/cover_tree.html"&gt;Cover trees&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.mit.edu/~andoni/LSH/"&gt;Locality-Sensitive Hashing (LSH)&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.cs.umd.edu/~mount/ANN/"&gt;ANN: A Library for Approximate Nearest Neighbor Searching&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-6199183879431610748?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/6199183879431610748/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=6199183879431610748' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/6199183879431610748'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/6199183879431610748'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/04/nearest-neighbor-methods.html' title='Nearest Neighbor Methods'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-1938425202626481071</id><published>2008-04-02T08:39:00.000-07:00</published><updated>2008-04-02T08:40:42.198-07:00</updated><title type='text'>Test set bound</title><content type='html'>Pantelis Monogioudis, a student in the class, implemented the test set bound in matlab.  The implementation is &lt;a href="http://hunch.net/~coms-4771/bound.zip"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-1938425202626481071?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/1938425202626481071/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=1938425202626481071' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/1938425202626481071'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/1938425202626481071'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/04/test-set-bound.html' title='Test set bound'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-5230267602159030827</id><published>2008-04-01T18:26:00.000-07:00</published><updated>2008-04-01T18:36:12.492-07:00</updated><title type='text'>Large Scale Learning</title><content type='html'>A link to &lt;a href="http://hunch.net/~vw/"&gt;Vowpal Wabbit&lt;/a&gt; (Fast Online Learning at Yahoo! Research).&lt;br /&gt;A link to &lt;a href="http://hadoop.apache.org/"&gt;Hadoop&lt;/a&gt;, also &lt;a href="http://en.wikipedia.org/wiki/Hadoop"&gt;here&lt;/a&gt; and &lt;a href="http://developer.yahoo.com/blogs/hadoop/"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-5230267602159030827?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/5230267602159030827/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=5230267602159030827' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/5230267602159030827'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/5230267602159030827'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/04/large-scale-learning.html' title='Large Scale Learning'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-1765917616092641403</id><published>2008-03-28T08:28:00.000-07:00</published><updated>2008-03-28T08:46:39.915-07:00</updated><title type='text'>Active Learning</title><content type='html'>D. Cohn, L. Atlas, R. Ladner. &lt;a href="http://www.cs.cmu.edu/~cohn/psyche/selsampling.ps.Z"&gt;Improving generalization with active learning&lt;/a&gt;, Machine Learning, 15(2), 1994.&lt;br&gt;&lt;br /&gt;S. Dasgupta, D.J. Hsu, and C. Monteleoni. &lt;a href="http://www.cs.ucsd.edu/%7Edjhsu/cal.html"&gt;A general agnostic active learning algorithm&lt;/a&gt;.&lt;br /&gt;&lt;em&gt; Neural Information Processing Systems (NIPS)&lt;/em&gt;, 2007.&lt;br /&gt;&lt;p&gt; S. Dasgupta. &lt;a href="http://www-cse.ucsd.edu/%7Edasgupta/papers/sample.ps"&gt;Coarse sample complexity bounds for active learning&lt;/a&gt;.&lt;br /&gt;&lt;em&gt; Neural Information Processing Systems (NIPS)&lt;/em&gt;, 2005. &lt;/p&gt;&lt;p&gt; S. Dasgupta, A. Kalai, and C. Monteleoni. &lt;a href="http://www-cse.ucsd.edu/%7Edasgupta/papers/DKMcolt05.ps"&gt;Analysis of perceptron-based active learning&lt;/a&gt;.&lt;br /&gt;&lt;em&gt;Eighteenth Annual Conference on Learning Theory (COLT)&lt;/em&gt;, 2005.&lt;/p&gt;&lt;p&gt;The algorithm A&lt;sup&gt;2&lt;/sup&gt; described in the class:&lt;br /&gt;M-F. Balcan, A. Beygelzimer, and J. Langford. &lt;a href="http://www.blogger.com/hunch.net/%7Ejl/projects/agnostic_active/agnostic-active.pdf"&gt;Agnostic Active Learning&lt;/a&gt;, ICML 2006.&lt;br /&gt;&lt;a href="http://videolectures.net/mlss06tw_langford_aal/"&gt;Video lecture&lt;/a&gt; on A&lt;sup&gt;2&lt;/sup&gt;. &lt;/p&gt;&lt;p&gt;Hanneke, S. (2007).  &lt;a href="http://www.cs.cmu.edu/%7Eshanneke/docs/2007/hanneke-teaching-dimension.pdf"&gt;Teaching Dimension and the Complexity of Active Learning.&lt;/a&gt; In proceedings of the 20&lt;sup&gt;th&lt;/sup&gt; Annual Conference on Learning Theory (COLT).&lt;br&gt;&lt;br /&gt;Hanneke, S. (2007).  &lt;a href="http://www.cs.cmu.edu/%7Eshanneke/docs/2007/hanneke-agnostic-active.pdf"&gt; A Bound on the Label Complexity of Agnostic Active Learning.&lt;/a&gt; In proceedings of the 24&lt;sup&gt;th&lt;/sup&gt; Annual International Conference on Machine Learning (ICML).&lt;br&gt;&lt;br /&gt;M-F. Balcan, A. Broder, and T. Zhang. &lt;a href="http://www.cs.cmu.edu/%7Eninamf/papers/active-ls.pdf"&gt;Margin Based Active Learning&lt;/a&gt;, COLT 2007.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-1765917616092641403?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/1765917616092641403/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=1765917616092641403' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/1765917616092641403'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/1765917616092641403'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/03/active-learning.html' title='Active Learning'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-1886721251473543617</id><published>2008-03-14T19:59:00.000-07:00</published><updated>2008-03-14T20:13:18.475-07:00</updated><title type='text'>Midterm statistics</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp0.blogger.com/_8Lk5cF9qemw/R9s-LOJjp-I/AAAAAAAAAJQ/TAy0rqGVG3s/s1600-h/grades.png"&gt;&lt;img style="margin: 0pt 10px 10px 0pt; float: left; cursor: pointer;" src="http://bp0.blogger.com/_8Lk5cF9qemw/R9s-LOJjp-I/AAAAAAAAAJQ/TAy0rqGVG3s/s320/grades.png" alt="" id="BLOGGER_PHOTO_ID_5177800559090182114" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Midterm statistics:&lt;br /&gt;&lt;br /&gt;Average grade: 17.14&lt;br /&gt;Median grade: 18&lt;br /&gt;Min: 0&lt;br /&gt;Max: 33&lt;br /&gt;Total number of students: 53&lt;br /&gt;&lt;br /&gt;Deergha will email the individual grades.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-1886721251473543617?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/1886721251473543617/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=1886721251473543617' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/1886721251473543617'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/1886721251473543617'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/03/midterm-statistics-average-grade-17.html' title='Midterm statistics'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://bp0.blogger.com/_8Lk5cF9qemw/R9s-LOJjp-I/AAAAAAAAAJQ/TAy0rqGVG3s/s72-c/grades.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-3981291794877943518</id><published>2008-03-08T12:37:00.000-08:00</published><updated>2008-03-08T13:10:25.201-08:00</updated><title type='text'>Midterm</title><content type='html'>The midterm is worth 25 points (25% of your final grade).&lt;br /&gt;&lt;br /&gt;There will be three problems (8 points each) and three simple exercises (3 points each).  Thus if you solve any two problems + all exercises, you will still get the maximum grade.  If you do everything, you will get 8 bonus points.&lt;br /&gt;&lt;br /&gt;The material covered in the following lectures will NOT be on the midterm:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Generalization bounds (the lecture before the midterm)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Reductions&lt;/li&gt;&lt;li&gt;Sanjoy's talk on projections, Gaussian scale mixtures, and the EM&lt;/li&gt;&lt;/ol&gt;One of the problems will be on the semantics of loss functions.  You should be able to answer questions of the form: What loss function does the following quantity minimize?  (See this &lt;a href="http://coms-4771.blogspot.com/2008/02/conditional-means-medians-and-all-that.html"&gt;post&lt;/a&gt;, for example.)&lt;br /&gt;&lt;br /&gt;The other two problems will be on any two of the following algorithms:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Winnow&lt;/li&gt;&lt;li&gt;Halving&lt;br /&gt;&lt;/li&gt;&lt;li&gt;(Randomized) Weighted Majority&lt;/li&gt;&lt;li&gt;Perceptron&lt;/li&gt;&lt;/ol&gt;The exercises will cover the rest of the material.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-3981291794877943518?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/3981291794877943518/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=3981291794877943518' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/3981291794877943518'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/3981291794877943518'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/03/midterm.html' title='Midterm'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-8359913828202472814</id><published>2008-02-04T18:01:00.001-08:00</published><updated>2008-02-04T20:02:21.086-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='absolute loss'/><category scheme='http://www.blogger.com/atom/ns#' term='quantile regression'/><category scheme='http://www.blogger.com/atom/ns#' term='squared loss'/><category scheme='http://www.blogger.com/atom/ns#' term='regression'/><title type='text'>Conditional means, medians and all that</title><content type='html'>&lt;span style="font-family: arial;font-family:times new roman;font-size:100%;"  &gt;Let D be a distribution over &lt;span style="font-style: italic;"&gt;X&lt;/span&gt; &lt;/span&gt;&lt;span style="font-family: arial;font-family:times new roman;font-size:100%;"  &gt;x&lt;span style="font-style: italic;"&gt; Y&lt;/span&gt;, where &lt;span style="font-style: italic;"&gt;X&lt;/span&gt; is some feature space and &lt;span style="font-style: italic;"&gt;Y&lt;/span&gt; is a real-valued label.  It is well known that the minimizer of the squared loss is the conditional mean mean(x) = &lt;/span&gt;&lt;span style="font-weight: bold; font-family: arial;font-family:times new roman;font-size:100%;"  &gt;E&lt;/span&gt;&lt;span style="font-family: arial;font-family:times new roman;font-size:100%;"  &gt;[y], where the expectation is over the draw of y from &lt;span style="font-style: italic;"&gt;D&lt;/span&gt;|x.  It's your homework problem number 3.  (For those of you who are overly concerned with rigor, assume that X and Y are closed subsets of R^m and R, respectively.  This is clearly true for any machine representation of X and Y.)&lt;br /&gt;&lt;br /&gt;In many situations, the mean is not sufficiently robust, and one is actually interested in the conditional median (see &lt;a href="http://books.google.com/books?id=nKVBXePWtSoC&amp;amp;printsec=frontcover&amp;amp;dq=quantile+regression&amp;amp;ei=eMmnR-bFDYbQiwHpnZSoCg&amp;amp;sig=pP3Tt_tVfFoxiaFdQkfSWxlw6qI"&gt;this&lt;/a&gt; book).  The conditional median is formally defined by median(x) = {q in R : D(Y ≤ q | X = x) ≥ 1/2 and D (Y ≥ q | X = x) ≥ 1/2}.   The median is also the solution to the problem of minimizing &lt;span style="font-weight: bold;"&gt;E&lt;/span&gt;|f(x)-y|, where the expectation is with respect to D.  &lt;/span&gt;&lt;span style="font-family: arial;font-family:times new roman;font-size:100%;"  &gt;Note that the median may not be unique when the conditional distribution has regions with zero mass.&lt;br /&gt;&lt;br /&gt;The argument goes like this:  Fix x.  Consider two equal point masses (with respect to D|x) at locations y1 and y2 in R.  The loss suffered by any prediction y in the interval [y1,y2] is (y-y1)+(y2-y) = (y2-y1).  Any y not in [y1,y2] always induces a larger loss, so the minimizer is in [y1,y2].  Since we can break D|x and break it into equal mass pairs with y2 above and y1 below the median (by the definition of the median), the absolute error loss is minimized when f(x) = median(x).  A similar argument holds for discrete random variables (special case).&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-8359913828202472814?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/8359913828202472814/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=8359913828202472814' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8359913828202472814'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8359913828202472814'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/02/conditional-means-medians-and-all-that.html' title='Conditional means, medians and all that'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-8592299293097741001</id><published>2008-02-04T17:02:00.000-08:00</published><updated>2008-02-04T17:22:23.334-08:00</updated><title type='text'>Reductions</title><content type='html'>&lt;ul  style="font-family:arial;"&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;John Langford's &lt;a href="http://hunch.net/%7Ejl/projects/reductions/reductions.html"&gt;reductions page&lt;/a&gt; (including a link to John's video lectures at different &lt;a href="http://www.mlss.cc/"&gt;MLSS&lt;/a&gt;)&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;Eun Bae Kong and Thomas Dietterich. &lt;a href="http://web.engr.oregonstate.edu/%7Etgd/publications/ml95-why.ps.gz"&gt;Error-Correcting Output Coding corrects bias and variance&lt;/a&gt;, ICML 95.&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;Amit Sahai and Venkat Guruswami. &lt;a href="http://www.cs.ucla.edu/%7Esahai/work/colt99boost.ps"&gt;Multiclass Learning, Boosting, and Error-correcting codes, COLT 1999.&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;See Yoram Singer's &lt;a href="http://www.cs.huji.ac.il/%7Esinger/pub.html"&gt;papers&lt;/a&gt; on multiclass learning, including the two below.&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;Erin Allwein, Robert Schapire, Yoram Singer. &lt;span style="font-size:100%;"&gt;&lt;a href="http://www.cs.huji.ac.il/%7Esinger/papers/ocmargin.ps.gz"&gt;Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers,&lt;/a&gt; Journal of Machine Learning Research, 2000&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-size:100%;"&gt;  Koby Crammer and  Yoram Singer. &lt;a href="http://www.cs.huji.ac.il/%7Esinger/papers/ocd.ps.gz"&gt;On the Learnability and Design of Output Codes     for Multiclass Problems,&lt;/a&gt; Machine Learning.&lt;/span&gt;&lt;/li&gt;&lt;li&gt; Robert E. Schapire. &lt;a href="http://www.cs.princeton.edu/%7Eschapire/uncompress-papers.cgi/Schapire97.ps"&gt;&lt;strong style="font-weight: normal;"&gt;Using output codes to boost multiclass learning problems&lt;/strong&gt;&lt;/a&gt;, ICML 9&lt;span style="text-decoration: underline;"&gt;7.&lt;/span&gt;&lt;a href="http://www.cs.princeton.edu/%7Eschapire/uncompress-papers.cgi/Schapire97.ps"&gt;&lt;br /&gt;&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-8592299293097741001?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/8592299293097741001/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=8592299293097741001' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8592299293097741001'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8592299293097741001'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2008/02/reductions.html' title='Reductions'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4294268551164982318.post-8281762742613800457</id><published>2007-12-18T10:50:00.000-08:00</published><updated>2008-01-20T16:58:30.831-08:00</updated><title type='text'>First class</title><content type='html'>Tue, Jan 22, 2:40pm-3:55pm, Hamilton Hall, Room 503.&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;br /&gt;Please go through &lt;a href="http://hunch.net/%7Ecoms-4771/lectures.html"&gt;these&lt;/a&gt; notes to refresh your memory on probability and statistics (if you need a refresher).&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;a href="http://www.netflixprize.com/"&gt;Netflix prize&lt;/a&gt; links:&lt;br /&gt;&lt;a href="http://www.research.att.com/%7Evolinsky/netflix/index.html"&gt;KorBell&lt;/a&gt;/BellKor winning team ($50,000 Progress Prize)&lt;br /&gt;&lt;a href="http://www.netflixprize.com/leaderboard"&gt;Leaderboard&lt;/a&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt; (follow the links to learn about individual teams)&lt;br /&gt;&lt;br /&gt;Deanonymizing the &lt;/span&gt;&lt;span style="font-size:100%;"&gt;Netflix dataset:&lt;br /&gt;A couple security researchers &lt;a href="http://arxiv.org/abs/cs/0610105"&gt;claim to have cracked the netflix dataset&lt;/a&gt; (&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;a href="http://hunch.net/index.php?s=netflix"&gt;discussion at John Langford's blog&lt;/a&gt;).&lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;--Alina&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4294268551164982318-8281762742613800457?l=coms-4771.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://coms-4771.blogspot.com/feeds/8281762742613800457/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4294268551164982318&amp;postID=8281762742613800457' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8281762742613800457'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4294268551164982318/posts/default/8281762742613800457'/><link rel='alternate' type='text/html' href='http://coms-4771.blogspot.com/2007/12/first-class.html' title='First class'/><author><name>COMS 4771</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry></feed>
