<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/'><id>tag:blogger.com,1999:blog-3366935554564939610.post8182634975616321155..comments</id><updated>2008-12-06T14:47:03.848-05:00</updated><category term='Web analytics'/><category term='Netflix'/><category term='Not Data Mining'/><category term='SQL'/><category term='SAS Code'/><category term='MapReduce'/><category term='forecasting'/><category term='Ab Initio'/><category term='Ask a data miner'/><category term='Data Mining'/><category term='Assocation Rules'/><category term='In The News'/><category term='Interview'/><category term='health care'/><category term='Enterprise Miner'/><category term='user question'/><category term='Conferences'/><category term='Memory Based Reasoning'/><category term='hierarchical modeling'/><category term='Survival Analysis'/><category term='Hadoop'/><category term='Our Books'/><category term='Clustering'/><category term='Privacy'/><category term='marketing'/><category term='statistics'/><category term='Principal Components'/><category term='gordon'/><category term='J'/><category term='Michael'/><category term='database'/><category term='Excel'/><category term='Neural Networks'/><title type='text'>Comments on Data Miners Blog: Using SQL to Emulate MapReduce Functionality</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://blog.data-miners.com/feeds/8182634975616321155/comments/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3366935554564939610/8182634975616321155/comments/default'/><link rel='alternate' type='text/html' href='http://blog.data-miners.com/2008/02/using-sql-to-emulate-mapreduce.html'/><author><name>Michael J. A. Berry</name><uri>http://www.blogger.com/profile/06077102677195066016</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='21' src='http://bp3.blogger.com/_3z621iWNCuE/R8wieIFZ7VI/AAAAAAAAAAM/JjoJs2TTnE8/S220/mjabBE.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>2</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3366935554564939610.post-8534343603600246433</id><published>2008-12-06T13:56:00.000-05:00</published><updated>2008-12-06T13:56:00.000-05:00</updated><title type='text'>One can do it in Oracle SQL this way:&lt;br&gt;&lt;br&gt;creat...</title><content type='html'>One can do it in Oracle SQL this way:&lt;BR/&gt;&lt;BR/&gt;create table productx &lt;BR/&gt;(id number, description clob)&lt;BR/&gt;/&lt;BR/&gt;&lt;BR/&gt;create or replace type t_vc2_50 as table of varchar2(50)&lt;BR/&gt;/&lt;BR/&gt;&lt;BR/&gt;select b.column_value token&lt;BR/&gt;,      count(*)&lt;BR/&gt;from &lt;BR/&gt;(select &amp;#39;,&amp;#39;||description||&amp;#39;,&amp;#39; txt &lt;BR/&gt; from productx ) t&lt;BR/&gt;,table ( cast ( multiset &lt;BR/&gt; ( select substr (txt,&lt;BR/&gt;   instr (txt, &amp;#39;,&amp;#39;, 1, level  ) + 1,&lt;BR/&gt;   instr (txt, &amp;#39;,&amp;#39;, 1, level+1) &lt;BR/&gt;   - instr (txt, &amp;#39;,&amp;#39;, 1, level) -1 ) token&lt;BR/&gt;   from dual&lt;BR/&gt;   connect by level &amp;lt;= length(t.txt)&lt;BR/&gt;   -length(replace(t.txt,&amp;#39;,&amp;#39;,&amp;#39;&amp;#39;))-1&lt;BR/&gt; ) as t_vc2_50 ) ) b&lt;BR/&gt;group by b.column_value&lt;BR/&gt;/&lt;BR/&gt;&lt;BR/&gt;&lt;BR/&gt;The connect by level &amp;lt;= .. is a way to generate a table with numbers from 1 to n. (the enumerate table)&lt;BR/&gt;&lt;BR/&gt;One could also use user defined funtions. &lt;BR/&gt;&lt;BR/&gt;RC</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3366935554564939610/8182634975616321155/comments/default/8534343603600246433'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3366935554564939610/8182634975616321155/comments/default/8534343603600246433'/><link rel='alternate' type='text/html' href='http://blog.data-miners.com/2008/02/using-sql-to-emulate-mapreduce.html?showComment=1228589760000#c8534343603600246433' title=''/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://blog.data-miners.com/2008/02/using-sql-to-emulate-mapreduce.html' ref='tag:blogger.com,1999:blog-3366935554564939610.post-8182634975616321155' source='http://www.blogger.com/feeds/3366935554564939610/posts/default/8182634975616321155' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-770764820'/></entry><entry><id>tag:blogger.com,1999:blog-3366935554564939610.post-7300886778865751642</id><published>2008-07-05T17:01:00.000-04:00</published><updated>2008-07-05T17:01:00.000-04:00</updated><title type='text'>Hi Gordon, &lt;br&gt;&lt;br&gt;Great post! This is one of the ...</title><content type='html'>Hi Gordon, &lt;BR/&gt;&lt;BR/&gt;Great post! This is one of the most interesting observations I've seen on the Web in many months. &lt;BR/&gt;&lt;BR/&gt;I think it can be done with SSIS/UDFs in 2008 and sparse tables.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3366935554564939610/8182634975616321155/comments/default/7300886778865751642'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3366935554564939610/8182634975616321155/comments/default/7300886778865751642'/><link rel='alternate' type='text/html' href='http://blog.data-miners.com/2008/02/using-sql-to-emulate-mapreduce.html?showComment=1215291660000#c7300886778865751642' title=''/><author><name>Anonymous</name><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://blog.data-miners.com/2008/02/using-sql-to-emulate-mapreduce.html' ref='tag:blogger.com,1999:blog-3366935554564939610.post-8182634975616321155' source='http://www.blogger.com/feeds/3366935554564939610/posts/default/8182634975616321155' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-1558112172'/></entry></feed>
