<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Warp Speed Computing</title>
	<atom:link href="http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/</link>
	<description>Random samplings from a universe of ideas.</description>
	<lastBuildDate>Mon, 13 Feb 2012 22:30:13 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: /home/avaloncio :) &#187; Blog Archive &#187; Lo que viene&#8230; los Procesadores Warp</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33009</link>
		<dc:creator>/home/avaloncio :) &#187; Blog Archive &#187; Lo que viene&#8230; los Procesadores Warp</dc:creator>
		<pubDate>Fri, 07 Dec 2007 21:36:53 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33009</guid>
		<description>[...] (Vía .............................. ... ... ... ... ... ... ... ... ... ... Cosmic Variance) [...]</description>
		<content:encoded><![CDATA[<p>[...] (Vía &#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; Cosmic Variance) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Quantum</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33008</link>
		<dc:creator>John Quantum</dc:creator>
		<pubDate>Fri, 07 Dec 2007 07:26:01 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33008</guid>
		<description>Technology like this, plus technology that is more quantum-field attuned would bridge to eacother pretty good. Someday, there can be file storage and some processing that is within the quantum field of a small computer that defines and processes data through the field around it. Just as we are processors and vehicles for the information that is stored in the quantum field around us. I envision a future of merging the divine with computers. This warp speed computing idea is pretty good. I foresee a program where someone can chat with higher dimensional beings with their computer or a whole array of applications. Since we live in an electrical universe, computers are electrical too.</description>
		<content:encoded><![CDATA[<p>Technology like this, plus technology that is more quantum-field attuned would bridge to eacother pretty good. Someday, there can be file storage and some processing that is within the quantum field of a small computer that defines and processes data through the field around it. Just as we are processors and vehicles for the information that is stored in the quantum field around us. I envision a future of merging the divine with computers. This warp speed computing idea is pretty good. I foresee a program where someone can chat with higher dimensional beings with their computer or a whole array of applications. Since we live in an electrical universe, computers are electrical too.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kaleberg</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32994</link>
		<dc:creator>Kaleberg</dc:creator>
		<pubDate>Thu, 25 Oct 2007 23:24:50 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32994</guid>
		<description>Sussman and his people at MIT used to talk about the dynamicist&#039;s workbench back in the 1980s. They were trying to raise the computer power to human reason ratio. There were quite interested in dynamic rewiring for special purpose computations. As a side project, Sussman and Wisdom built a digital orrery to study the orbit of Pluto. Basically, the idea was to wire up an old fashioned analog computer for planetary dynamics, but have it do digital computation, and be subject to digital error analysis. (They had to kludge the time interval so that numerical errors didn&#039;t break energy conservation).

Some of the old school analog computation people were really into this idea. They visualized computations as wires and nodes. I remember one mechanical engineering professor, who was a big fan of nomograms. He wanted a set of networkable blocks so he could build a computer program on his desk top and have it run digitally. We could probably do this today, but few people are used to thinking about algorithms that way anymore, despite the inherent parallellism that such thinking requires. This was back in the early 1970s, so this idea has old roots.</description>
		<content:encoded><![CDATA[<p>Sussman and his people at MIT used to talk about the dynamicist&#8217;s workbench back in the 1980s. They were trying to raise the computer power to human reason ratio. There were quite interested in dynamic rewiring for special purpose computations. As a side project, Sussman and Wisdom built a digital orrery to study the orbit of Pluto. Basically, the idea was to wire up an old fashioned analog computer for planetary dynamics, but have it do digital computation, and be subject to digital error analysis. (They had to kludge the time interval so that numerical errors didn&#8217;t break energy conservation).</p>
<p>Some of the old school analog computation people were really into this idea. They visualized computations as wires and nodes. I remember one mechanical engineering professor, who was a big fan of nomograms. He wanted a set of networkable blocks so he could build a computer program on his desk top and have it run digitally. We could probably do this today, but few people are used to thinking about algorithms that way anymore, despite the inherent parallellism that such thinking requires. This was back in the early 1970s, so this idea has old roots.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Procesadores inteligentes que se &#8220;auto-configuran&#8221; para ir más rápido &#171; The ZeRoX Blog</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32993</link>
		<dc:creator>Procesadores inteligentes que se &#8220;auto-configuran&#8221; para ir más rápido &#171; The ZeRoX Blog</dc:creator>
		<pubDate>Wed, 24 Oct 2007 21:38:34 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32993</guid>
		<description>[...] (Vía .............................. ... ... ... ... ... ... ... ... ... ... Cosmic Variance) [...]</description>
		<content:encoded><![CDATA[<p>[...] (Vía &#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230;&#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; &#8230; Cosmic Variance) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Torbjörn Larsson, OM</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33007</link>
		<dc:creator>Torbjörn Larsson, OM</dc:creator>
		<pubDate>Mon, 22 Oct 2007 19:49:13 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33007</guid>
		<description>&lt;blockquote&gt;
Rather, a good idea frequently exposes yet another bottleneck that was hidden before.
&lt;/blockquote&gt;

Yes, but that isn&#039;t a matter of complaint as such.

When you say that DRAMs are substantially slower, don&#039;t you mean that large memory space and subsequent deep data channels are the problems? I&#039;m not sure how to tackle that one though - it seems as least as hard a problem to break down as the serial-to-parallel challenge is.</description>
		<content:encoded><![CDATA[<blockquote><p>
Rather, a good idea frequently exposes yet another bottleneck that was hidden before.
</p></blockquote>
<p>Yes, but that isn&#8217;t a matter of complaint as such.</p>
<p>When you say that DRAMs are substantially slower, don&#8217;t you mean that large memory space and subsequent deep data channels are the problems? I&#8217;m not sure how to tackle that one though &#8211; it seems as least as hard a problem to break down as the serial-to-parallel challenge is.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: links for 2007-10-22 &#171; Chatquah and Galoshes</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33006</link>
		<dc:creator>links for 2007-10-22 &#171; Chatquah and Galoshes</dc:creator>
		<pubDate>Mon, 22 Oct 2007 15:24:17 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33006</guid>
		<description>[...] another &#8220;advance&#8221; in computing basically more specialized processors attached to your microprocessor (tags: computing technology) [...]</description>
		<content:encoded><![CDATA[<p>[...] another &#8220;advance&#8221; in computing basically more specialized processors attached to your microprocessor (tags: computing technology) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jt</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33005</link>
		<dc:creator>jt</dc:creator>
		<pubDate>Mon, 22 Oct 2007 15:07:23 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33005</guid>
		<description>In general (for traditional FPGAs) you&#039;re talking milliseconds to seconds for full configuration.  Reconfiguration can be much faster.  For an architecture like this, though, the logic fabric is smallish, and it&#039;s very tightly coupled to the CPU.  You&#039;d still be looking at a it, but nowhere near as big.


Given that before the FPGA gets programmed, enough statistics have to be collected by the profiling unit to determine if it&#039;s worthwhile, it&#039;ll only be reprogrammed if it&#039;s worth it.  This is (roughly)  the scheme used for Just-In-Time compilation for CPU emulation, and it works there.

--jt</description>
		<content:encoded><![CDATA[<p>In general (for traditional FPGAs) you&#8217;re talking milliseconds to seconds for full configuration.  Reconfiguration can be much faster.  For an architecture like this, though, the logic fabric is smallish, and it&#8217;s very tightly coupled to the CPU.  You&#8217;d still be looking at a it, but nowhere near as big.</p>
<p>Given that before the FPGA gets programmed, enough statistics have to be collected by the profiling unit to determine if it&#8217;s worthwhile, it&#8217;ll only be reprogrammed if it&#8217;s worth it.  This is (roughly)  the scheme used for Just-In-Time compilation for CPU emulation, and it works there.</p>
<p>&#8211;jt</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Haelfix</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32992</link>
		<dc:creator>Haelfix</dc:creator>
		<pubDate>Mon, 22 Oct 2007 15:00:46 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32992</guid>
		<description>How long does it take the main cpu to write the instruction code to the FPGA in general?  I&#039;d imagine that would be the huge bottleneck for mainstream apps.</description>
		<content:encoded><![CDATA[<p>How long does it take the main cpu to write the instruction code to the FPGA in general?  I&#8217;d imagine that would be the huge bottleneck for mainstream apps.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: hostab &#187; Blog Archive &#187; Comment on Warp Speed Computing by Jason Dick</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33004</link>
		<dc:creator>hostab &#187; Blog Archive &#187; Comment on Warp Speed Computing by Jason Dick</dc:creator>
		<pubDate>Mon, 22 Oct 2007 14:43:32 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33004</guid>
		<description>[...] the details here  Author  Comments [...]</description>
		<content:encoded><![CDATA[<p>[...] the details here  Author  Comments [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jt</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33003</link>
		<dc:creator>jt</dc:creator>
		<pubDate>Mon, 22 Oct 2007 14:13:32 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33003</guid>
		<description>No FH, you&#039;re right, if it all works.

But such automatic compilation of designs has been the &quot;holy grail&quot; of not just the FPGA world, but the entire parallel/scientific computing field for 40 years.  It&#039;s just really hard to extract more than a few way parallelism from non-trivial problems that are stated serially, even without the memory bottleneck.  And it&#039;s hard as hades for most humans to express parallel algorithms as such.

There has been a LOT of work done here, and every high-level or automatic design compiler I&#039;ve seen makes a major tradeoff in either flexibility, performance, ease of use.

Usually, you can only pick one (or at most 2).
(e.g. Mitrionics, which focuses on ease-of-use,Handel-C which goes for flexibility, Viva, which is pretty good trades flexibility for a decent enough performance and )

The trouble I see here is in the target audience-- if you have a major scientific computation to perform that can be accelerated with FPGAs, why would you use this custom processor, when for a little more effort you can get an FPGA designer who can make a custom design which will probably outperform this guy.  If you have a webserver with many trivially parallel tasks, unless they map to what the desing compiler wants to see, you&#039;ve wasted your silicon-- it probably makes sense to buy another COTS box &#039;o processors.

You&#039;re spending a lot of silicon on a design tool that you&#039;ll need once in a while-- don&#039;t get me wrong, it&#039;s very cool to have it on a chip, and it&#039;s a step towards &quot;evolvable, self-programming hardware&quot;, but I&#039;m not sure of the practicality.

Bioinformatics &amp; network security is one place where this might do very well.  You have lots of string and pattern matching which avoids floating point (which currently kills FPGAs), very regular memory accesses and very simple operations (counters, comparators and address generators).  But agian, the simplicity of the app makes me question whether it isn&#039;t worth the designer time vs. this automatic solution.

--jt</description>
		<content:encoded><![CDATA[<p>No FH, you&#8217;re right, if it all works.</p>
<p>But such automatic compilation of designs has been the &#8220;holy grail&#8221; of not just the FPGA world, but the entire parallel/scientific computing field for 40 years.  It&#8217;s just really hard to extract more than a few way parallelism from non-trivial problems that are stated serially, even without the memory bottleneck.  And it&#8217;s hard as hades for most humans to express parallel algorithms as such.</p>
<p>There has been a LOT of work done here, and every high-level or automatic design compiler I&#8217;ve seen makes a major tradeoff in either flexibility, performance, ease of use.</p>
<p>Usually, you can only pick one (or at most 2).<br />
(e.g. Mitrionics, which focuses on ease-of-use,Handel-C which goes for flexibility, Viva, which is pretty good trades flexibility for a decent enough performance and )</p>
<p>The trouble I see here is in the target audience&#8211; if you have a major scientific computation to perform that can be accelerated with FPGAs, why would you use this custom processor, when for a little more effort you can get an FPGA designer who can make a custom design which will probably outperform this guy.  If you have a webserver with many trivially parallel tasks, unless they map to what the desing compiler wants to see, you&#8217;ve wasted your silicon&#8211; it probably makes sense to buy another COTS box &#8216;o processors.</p>
<p>You&#8217;re spending a lot of silicon on a design tool that you&#8217;ll need once in a while&#8211; don&#8217;t get me wrong, it&#8217;s very cool to have it on a chip, and it&#8217;s a step towards &#8220;evolvable, self-programming hardware&#8221;, but I&#8217;m not sure of the practicality.</p>
<p>Bioinformatics &amp; network security is one place where this might do very well.  You have lots of string and pattern matching which avoids floating point (which currently kills FPGAs), very regular memory accesses and very simple operations (counters, comparators and address generators).  But agian, the simplicity of the app makes me question whether it isn&#8217;t worth the designer time vs. this automatic solution.</p>
<p>&#8211;jt</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fh</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33002</link>
		<dc:creator>fh</dc:creator>
		<pubDate>Mon, 22 Oct 2007 12:58:41 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33002</guid>
		<description>SW but isn&#039;t that the point though? FPGAs have been around for a long time but the need to &quot;handwrite&quot; its code limits it to specialized fields.
Automation would mean that it would run transparently to the application programer and a sleuth of current programs could benefit.

In other words, its main achievement is to change the economics of speeding up programs using RC, because it shifts the weight of coding the reconfiguration from the application developer (who is human, and thus very expensive) to the CPU (which is silicon and very very cheap, albeit of course much less capable then the human)

Or am I misreading this?</description>
		<content:encoded><![CDATA[<p>SW but isn&#8217;t that the point though? FPGAs have been around for a long time but the need to &#8220;handwrite&#8221; its code limits it to specialized fields.<br />
Automation would mean that it would run transparently to the application programer and a sleuth of current programs could benefit.</p>
<p>In other words, its main achievement is to change the economics of speeding up programs using RC, because it shifts the weight of coding the reconfiguration from the application developer (who is human, and thus very expensive) to the CPU (which is silicon and very very cheap, albeit of course much less capable then the human)</p>
<p>Or am I misreading this?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Puppy</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33001</link>
		<dc:creator>Puppy</dc:creator>
		<pubDate>Mon, 22 Oct 2007 12:14:40 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33001</guid>
		<description>I believe that transmeta tried to use software translation to traslate from i.e. x86 intructions-set to their own native VLIW-instruction-set. In other words, the operating system and the CPU didn&#039;t actually run the same intruction-set.

I think that this chip lives on as VIA EDEN, but I don&#039;t know if it still features the software translation that was the main feature of the transmeta chip. If I remember, VIA EDEN is featured as producing low power consumption (and of course not much heat).</description>
		<content:encoded><![CDATA[<p>I believe that transmeta tried to use software translation to traslate from i.e. x86 intructions-set to their own native VLIW-instruction-set. In other words, the operating system and the CPU didn&#8217;t actually run the same intruction-set.</p>
<p>I think that this chip lives on as VIA EDEN, but I don&#8217;t know if it still features the software translation that was the main feature of the transmeta chip. If I remember, VIA EDEN is featured as producing low power consumption (and of course not much heat).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Arun</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-33000</link>
		<dc:creator>Arun</dc:creator>
		<pubDate>Mon, 22 Oct 2007 11:31:46 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-33000</guid>
		<description>To reiterate Jick&#039;s point, unless your data set is small or the required fetches from memory are extremely predictable, your warp speed processor is going to be spending most of its time twiddling its thumbs.</description>
		<content:encoded><![CDATA[<p>To reiterate Jick&#8217;s point, unless your data set is small or the required fetches from memory are extremely predictable, your warp speed processor is going to be spending most of its time twiddling its thumbs.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Dick</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32999</link>
		<dc:creator>Jason Dick</dc:creator>
		<pubDate>Mon, 22 Oct 2007 10:26:51 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32999</guid>
		<description>I thought Transmeta focused on VLIW for their processors?

In any case, yeah, this seems to be a new algorithm based on an old idea.  I first heard about this sort of thing with Starbridge Systems about eight years ago or so:
http://www.starbridgesystems.com/

What seems interesting about this FPGA idea, to me, is that it seems like they are attempting to use an FPGA to speed up general processing, without reprogramming.  Now, this sort of thing rarely speeds up programming by a significant portion, but given how challenging it is to make new, faster processors, this sounds like an excellent idea to improve performance of general-purpose processors a few percent more.</description>
		<content:encoded><![CDATA[<p>I thought Transmeta focused on VLIW for their processors?</p>
<p>In any case, yeah, this seems to be a new algorithm based on an old idea.  I first heard about this sort of thing with Starbridge Systems about eight years ago or so:<br />
<a href="http://www.starbridgesystems.com/" rel="nofollow">http://www.starbridgesystems.com/</a></p>
<p>What seems interesting about this FPGA idea, to me, is that it seems like they are attempting to use an FPGA to speed up general processing, without reprogramming.  Now, this sort of thing rarely speeds up programming by a significant portion, but given how challenging it is to make new, faster processors, this sounds like an excellent idea to improve performance of general-purpose processors a few percent more.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ben</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32991</link>
		<dc:creator>Ben</dc:creator>
		<pubDate>Mon, 22 Oct 2007 10:17:23 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32991</guid>
		<description>If memory serves, FPGAs were one of the unique aspects of Transmeta&#039;s microchip program back in the late 90s. Their site doesn&#039;t have much info on it any more (http://www.transmeta.com/tech/microip.html), but I believe this was the basis of their &quot;software-based microprocessor.&quot; At any rate, this does not seem to be a new idea, at least on the surface. Perhaps a new application of an existing idea. Transmeta&#039;s chip were renowned for optimizing themselves over time as they adapted to the host OS that was installed.</description>
		<content:encoded><![CDATA[<p>If memory serves, FPGAs were one of the unique aspects of Transmeta&#8217;s microchip program back in the late 90s. Their site doesn&#8217;t have much info on it any more (<a href="http://www.transmeta.com/tech/microip.html" rel="nofollow">http://www.transmeta.com/tech/microip.html</a>), but I believe this was the basis of their &#8220;software-based microprocessor.&#8221; At any rate, this does not seem to be a new idea, at least on the surface. Perhaps a new application of an existing idea. Transmeta&#8217;s chip were renowned for optimizing themselves over time as they adapted to the host OS that was installed.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: &#8220;Warp speed&#8221; computing? &#171; WCIT WIS</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32998</link>
		<dc:creator>&#8220;Warp speed&#8221; computing? &#171; WCIT WIS</dc:creator>
		<pubDate>Mon, 22 Oct 2007 07:23:24 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32998</guid>
		<description>[...] &#8220;Warp speed&#8221;&#160;computing?  Zoals je het vaker hoort, zijn de beste ideeën bizar simpel. Onder deze &quot;trekkie&quot; term zou goed de toekomst van de computer zich kunnen verschuilen.FPGA’s (field-programmable gate array) zijn chipjes die gespecialiseerd zijn in een taak. Zijn kunnen maar een ding, maar dat kunnen ze heel erg snel. Ongeveer 1000 keer sneller dan een normale processor.2 onderzoekers van een universiteit in California hebben een architectuur bedacht waar zij beide technieken combineren: als een generieke CPU herkend dat een bepaalde taak vaak wordt herhaald, verplaats hij dat stukje code naar een FPGA. Ik zei toch dat het simpel was!   Meer info: hier. [...]</description>
		<content:encoded><![CDATA[<p>[...] &#8220;Warp speed&#8221;&nbsp;computing?  Zoals je het vaker hoort, zijn de beste ideeën bizar simpel. Onder deze &#8220;trekkie&#8221; term zou goed de toekomst van de computer zich kunnen verschuilen.FPGA’s (field-programmable gate array) zijn chipjes die gespecialiseerd zijn in een taak. Zijn kunnen maar een ding, maar dat kunnen ze heel erg snel. Ongeveer 1000 keer sneller dan een normale processor.2 onderzoekers van een universiteit in California hebben een architectuur bedacht waar zij beide technieken combineren: als een generieke CPU herkend dat een bepaalde taak vaak wordt herhaald, verplaats hij dat stukje code naar een FPGA. Ik zei toch dat het simpel was!   Meer info: hier. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jick</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32997</link>
		<dc:creator>jick</dc:creator>
		<pubDate>Mon, 22 Oct 2007 06:55:08 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32997</guid>
		<description>That is certainly an interesting idea, but what computer scientists (or rather, computer engineers) have found is that a single idea never solves all the performance issues. Rather, a good idea frequently exposes yet another bottleneck that was hidden before.

ILP (instruction-level parallelism) is a related idea, and also an old one. Even in the PC world, the original Pentium processor introduced a &quot;superscalar&quot; architecture, which meant that it can occasionally execute &lt;i&gt;two&lt;/i&gt; instructions at a single cycle.  Still, today&#039;s monster chips don&#039;t have a hundred instructions running parallel, and not because we don&#039;t have enough chip space to do that. Program codes have inherent interdependency, which limits the amount of things you can do at a time... unless you find really really clever tricks.

There&#039;s also the memory bottleneck issue. Modern DRAMs are usually hundreds of times slower than the CPU, which means the CPU often sits idle for hundreds of cycles just waiting for data. If the CPU suddenly becomes 1000x faster, the situation just becomes much worse (relatively speaking), unless we can redesign the code in a special way such that it does a LOT of computation with only a SMALL amount of data. (It could be feasible for some science applications, but impossible/impractical for most end-user applications.)

Wow, I can&#039;t believe I just wrote an informative reply to Cosmic Variance! *giggle*</description>
		<content:encoded><![CDATA[<p>That is certainly an interesting idea, but what computer scientists (or rather, computer engineers) have found is that a single idea never solves all the performance issues. Rather, a good idea frequently exposes yet another bottleneck that was hidden before.</p>
<p>ILP (instruction-level parallelism) is a related idea, and also an old one. Even in the PC world, the original Pentium processor introduced a &#8220;superscalar&#8221; architecture, which meant that it can occasionally execute <i>two</i> instructions at a single cycle.  Still, today&#8217;s monster chips don&#8217;t have a hundred instructions running parallel, and not because we don&#8217;t have enough chip space to do that. Program codes have inherent interdependency, which limits the amount of things you can do at a time&#8230; unless you find really really clever tricks.</p>
<p>There&#8217;s also the memory bottleneck issue. Modern DRAMs are usually hundreds of times slower than the CPU, which means the CPU often sits idle for hundreds of cycles just waiting for data. If the CPU suddenly becomes 1000x faster, the situation just becomes much worse (relatively speaking), unless we can redesign the code in a special way such that it does a LOT of computation with only a SMALL amount of data. (It could be feasible for some science applications, but impossible/impractical for most end-user applications.)</p>
<p>Wow, I can&#8217;t believe I just wrote an informative reply to Cosmic Variance! *giggle*</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Torbjörn Larsson, OM</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32996</link>
		<dc:creator>Torbjörn Larsson, OM</dc:creator>
		<pubDate>Mon, 22 Oct 2007 06:20:18 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32996</guid>
		<description>Oh, and I forgot to add that another driver for reconfigurable processing is its power efficiency, especially if one can convert synchronous processing in a local microprocessor to asynchronous in a more generalized net topology. So as the importance of mobility (and energy saving/prizes) goes up, the threshold for using it should decrease one may hope.</description>
		<content:encoded><![CDATA[<p>Oh, and I forgot to add that another driver for reconfigurable processing is its power efficiency, especially if one can convert synchronous processing in a local microprocessor to asynchronous in a more generalized net topology. So as the importance of mobility (and energy saving/prizes) goes up, the threshold for using it should decrease one may hope.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Torbjörn Larsson, OM</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32990</link>
		<dc:creator>Torbjörn Larsson, OM</dc:creator>
		<pubDate>Mon, 22 Oct 2007 06:14:20 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32990</guid>
		<description>I have always fancied idea of reconfigurable processing, but the field has never taken off. Hopefully approaches such as these works, they may be based on affordable and scalable realizations.

Speaking of wet dreams, I&#039;m just reading an article on cloud processing, among other background referring to the modular and minimal Xios/3 OS based on xml. Everything on scalable internet, and internet on everything modular. (And Google ogling the proceeds which is both promising and scary.) Keep your fingers crossed for survival in the hard world of software.</description>
		<content:encoded><![CDATA[<p>I have always fancied idea of reconfigurable processing, but the field has never taken off. Hopefully approaches such as these works, they may be based on affordable and scalable realizations.</p>
<p>Speaking of wet dreams, I&#8217;m just reading an article on cloud processing, among other background referring to the modular and minimal Xios/3 OS based on xml. Everything on scalable internet, and internet on everything modular. (And Google ogling the proceeds which is both promising and scary.) Keep your fingers crossed for survival in the hard world of software.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Carl Brannen</title>
		<link>http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/comment-page-1/#comment-32989</link>
		<dc:creator>Carl Brannen</dc:creator>
		<pubDate>Sun, 21 Oct 2007 23:45:20 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.discovermagazine.com/cosmicvariance/2007/10/21/warp-speed-computing/#comment-32989</guid>
		<description>I spent years working with FPGAs, mostly Xilinx. If you design programs for them by hand, than 1000s  of times faster is a slight understatement.

A Xilinx&lt;a href=&quot;http://direct.xilinx.com/bvdocs/publications/ds100.pdf&quot; rel=&quot;nofollow&quot;&gt; Virtex-5 FPGA&lt;/a&gt; has as many as 51,840 slices, which, if it was all devoted to arithmetic would give 207,360 single bit adders or 25,920 8-bit adders. They run at 550 MHz, so the theoretical maximum for 8-bit addition is 14,256 billion operations per second.

Actual ratios depend on the operation. Generally, the CPU will have advantages at complicated operations like floating point, especially multiplication and addition, while the FPGA will be blindingly fast on simple 1-bit wide logic operations on as many as 4 variables. Such a logic operation could be, for example, ((A or B) and (C or D)) or ((not A) and (not B)), that is, any logic function of four variables.

FPGAs are widely used in experimental particle physics.</description>
		<content:encoded><![CDATA[<p>I spent years working with FPGAs, mostly Xilinx. If you design programs for them by hand, than 1000s  of times faster is a slight understatement.</p>
<p>A Xilinx<a href="http://direct.xilinx.com/bvdocs/publications/ds100.pdf" rel="nofollow"> Virtex-5 FPGA</a> has as many as 51,840 slices, which, if it was all devoted to arithmetic would give 207,360 single bit adders or 25,920 8-bit adders. They run at 550 MHz, so the theoretical maximum for 8-bit addition is 14,256 billion operations per second.</p>
<p>Actual ratios depend on the operation. Generally, the CPU will have advantages at complicated operations like floating point, especially multiplication and addition, while the FPGA will be blindingly fast on simple 1-bit wide logic operations on as many as 4 variables. Such a logic operation could be, for example, ((A or B) and (C or D)) or ((not A) and (not B)), that is, any logic function of four variables.</p>
<p>FPGAs are widely used in experimental particle physics.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk

Served from: blogs.discovermagazine.com @ 2012-02-13 22:34:20 -->
