Login

ARM The Architecture For The Digital World  

ARM Community: SoC Design - ARM Community

Jump to content

ARM Performance Analysis Made Easy

I hate reading manuals. Maybe it’s a guy thing or maybe it’s the engineer in me, but it’s pretty rare that I’ll bring home a shiny new device and then sit down and read the manual. Give me a few pictures or examples on how something should be used and I’m good to go. Not surprisingly, a lot of customers take this same approach with our software products. Immediately after installation they’ll start things up and look around for an examples directory long before they’ll ever click on the “Help” button.

This approach works well for a lot of tasks but when it comes to more complex tasks that example needs to be very sophisticated to be of value. System on chip (SoC) performance analysis, certainly meets that “complex task” threshold. The best way to analyze the performance of an SoC before it is built is by using a virtual prototype. Before any performance analysis work can be performed however, there is a long list of tasks to be performed. Models need to be assembled and configured for all of the important design elements, or even written if the models don’t already exist. These models need to be pulled together into a system configuration which matches the end design. Finally, software must be written to initialize all of the components in the system and then generate some meaningful traffic. Hopefully your virtual prototyping tool has some good data visualization tools. If not, you can add data extract...

Foundry and IP Business Model: Alive and Well

In my role, I serve as one of the members of the Global Semiconductor Alliance (GSA) Steering Committee on Intellectual Property, where we work to share best practices and continue to improve the IP ecosystem for the benefit of the entire semiconductor industry. As part of this role, I’ve observed a trend in the news speculating on the future of the foundry and IP industry, and I recently posted my thoughts on the GSA blog site, and I’d like to share them with you here as well.

In 1897, after a journalist erroneously reported the passing of famed author and humorist Mark Twain, Twain replied in his typical wit with the now famous retort “The rumor of my death has been greatly exaggerated.” Like the then very alive author, recent reports have speculated on the demise of the foundry and IP business model. I similarly think such talk is pure nonsense. Across many metrics the foundry and IP space is alive and well and providing unprecedented capabilities to semiconductor companies.

Let’s put the tabloid down and look at the facts from a few different perspectives.

...

Virtual Platforms Accelerate Embedded Software Development

Transcending “Reality” with Virtual Platforms
"Virtual reality is the first step in a grand adventure into the landscape of the imagination.”
--Communication in the Age of Virtual Reality by Frank Biocca, Taeyong Kim and Mark Levy

So what is the reality, and virtual reality, for embedded software developers? Virtual platforms will be increasingly important, especially for multi-core designs. The April 2012 EE Times Embedded Software Survey reported that 33% of software developers say virtual platforms are becoming more important to accelerate their schedules.

Attached Image

Source: UBM Electronics -- Embedded.com & EE Times.


Why Use Virtual Platforms?
When hardware is not available, you can still start software development early and with confidence. Virtual platforms can be programming-model hardware accurate, and allow you to run the exact binary you code for the real device. They deliver the level of abstraction and performance needed to run operating systems and application software and integrate external system components and interfaces.

Even when hardware is available, virtual platforms offer significant advantages. Virtual platform changes can be instant...

The Next Big Thing: From the Newton to Smartphone... to a Smartchip Implant?

Just last week while at a friend’s home, I noticed an interesting device from the past – an Apple Newton, circa mid-1990s – a device way ahead of its time. An early personal digital assistant (PDA) based on the 32-bit ARM 610 processor on 1.2um CMOS process, running at 20-33Mhz with battery life of over 30 hours of continuous use. There’s still quite a following on this extraordinary device that even featured handwriting recognition.

Fast forward to present day. Now, we have the latest smartphones – mobile computing platforms featuring a PDA, camera, GPS, WiFi, web browser, touchscreen and of course, cell phone. The typical advanced smartphone is based on the ARM® CortexTM-A9 processor core on 45nm to 28nm process technologies, running at or greater than 1GHz with limited battery life of approximately 10 hours max.

Whoa …..so what happened to the battery life?? Granted there are phenomenally greater features and performance, at the cost of power, i.e. 40x in performance from the Newton but one-third ...

Instant Replay of Your ARM Cortex-A Series Simulations and Emulations

If you've watched any sporting event on television lately, you've seen the pressure put on referees and umpires. They have to make split-second decisions in real-time, having viewed ultra-high-speed action just a single time from a single vantage point. But watching at home on television, we get the luxury of viewing multiple replays of events in question in high-definition super-slow-motion, one frame at a time, and even in reverse. We also get to see many different views of these controversial events, from the front, the back, the side, up close, or far away. Sometimes it seems there must be twenty different cameras at every sporting event.

Wouldn't it be nice if you could apply this same principle to simulation and emulation runs of your ARM® Cortex™-A series processor-based SoC designs? What if you had instant replay from multiple viewing angles in your functional verification toolbox? It turns out that such a technology indeed exists, and it's called "Codelink Replay".

Mentor Graphics’ ...

Squaring the circle - Optimizing power efficiency in a Cortex-A15 processor

It is entirely appropriate that ARM will announce technical details of its latest hard macro product, the Cortex™-A15 MP4 Hard Macro for TSMC 28HPM node at COOL Chips XV, the IEEE Symposium on Low-Power and High-Speed Chips, being held this week in Yokohama, Japan (18-20th April, 2012). This exciting new hard macro not only perfectly encapsulates the theme of the symposium, but also pulls together the contemporary and divergent design challenges of offering extremely high-performance compute engines within a conservative power budget.

The Cortex-A15 MP4 Hard Macro is a high performance, power-optimized quad-core hard macro implementation of our flagship Cortex-A15 processor, on leading 28nm process. It delivers three significant firsts for the ARM hard macro portfolio, as not only is this the first quad–core hard macro, but also the first hard macro based on the highest performance ARMv7 architecture-based Cortex-A15 processor, and it is also the first hard macro based on 28nm process.

In terms of configuration, the Cortex-A15 MP...

Early Power, Performance, Area Analysis & AMBA Designer: A Winning Combo

Power, performance and area or “PPA,” as it is called, has become a universally interesting topic to system-on-chip (SoC) designers around the world. Atrenta – an ARM Connected Community Partner and a leading provider of SoC Realization solutions for the semiconductor and electronic systems industries – showcased a new and innovative design flow for early PPA estimation at last year’s ARM TechCon.

Design complexity now demands that all aspects of the design be co-optimized. If you reduce power, you will impact performance and area and so on. A holistic approach that balances all requirements of the chip is needed to deploy SoCs successfully.

Improving PPA by analyzing interconnect fabric earlier
Anyone will tell you that the interconnect...

Spreading the Multicore Message 'Down Under'

ARM is heading to Multicore World 2012 to discuss ARM MPCore solutions and grow the development ecosystem in New Zealand.

If you were to ask most people what New Zealand is famous for, you’d probably hear it has more sheep than people, suggesting that the most that small country down-under can offer is open spaces and woolly clothing – how wrong.

Later this month, the balance will be put a little out of shape, starting on the 27th March for two days a number of international speakers and researchers and I will be taking time to specifically travel to New Zealand to address and participate within their growing community of businesses and technology innovators addressing the international challenge of multicore at Multicore World 2012.

ARM today has very little of its development ecosystem located in the southern hemisphere, however this opportunity to inform the community about the ARM MPCore solutions and vision was too good to be missed. As such, I will be travelling fr...

Memory BISTing Your SoC

Co-authored by Stephen Pateras, Mentor Product Marketing Director, Silicon Test
---
Memory Built-In Self-Test (BIST) has existed for as long as there have been modern SoCs. With today’s advanced SoCs containing literally hundreds and even thousands of SRAM memories, clustered in many memory subsystems throughout the SoC, both memory BIST as well as Built-In-Self-Repair (BISR) technology are increasingly important to insure the highest possible yield. System-on-chip designs are tailored for the optimal power, performance, area (PPA) and cost for a specific target application. To meet these increasingly challenging goals, designers are implementing solutions using multiple power domains, allowing the memory to run at high speed while conserving power in the periphery. These factors, combined with the different types of memories, including high density and high speed; single port, multi-port, register files and ROMs add to the complex task of ensuring the SoC performs to specification, and doesn’t limit yield in high volume production.

ARM has teamed up with ...

ARM, Digilent, and Xilinx Co-sponsor SoC Design Competition at MIT

As this is my first blog, maybe I should I introduce myself… My name is George and I’m responsible for a variety of marketing communications for the ARM® University Program.

Every year during the month of January, the Massachusetts Institute of Technology (MIT) gives their students a full month off from regular courses to pursue their own personal ingenuity, something they like to call the Independent Activities Period (IAP). From improv-comedy workshops to specified bio-chemistry lectures, a whole slew of activities are made available to students. Included in the mix this year for the very first time was a System-On-Chip (SoC) design competition sponsored by ARM and two ARM partners: Digilent and Xilinx.

“This is very exciting for ARM, Xilinx, and Digilent. A design competition like this is not only unprecedented, but is bein...

Getting to Market Quicker with a POP

Find out more about what ARM’s Processor Optimization Pack (POP) is and its advantages.

Like many people who work in high tech, from time to time I’ve had trouble explaining my profession to family and friends. Unless you are pretty tech savvy it is hard to understand exactly what a SoC engineer does everyday. The other day I was driving in my car dropping my son off at Cub Scouts when he asked me, “Dad what do you do at work?” It’s the kind of question that comes out of the blue and you struggle to answer. I paused for a few moments searching for a simplified answer and responded “I make ARM processors go faster.” He seemed satisfied with that answer, and quickly became distracted when his favorite song came on the radio; when you’re a seven year old you just love singing in the car!

When I joined ARM in 1997 I was a SoC engineer in ARM’s consulting group, the group’s main focus was to enable a Partner to get to market faster by taking advantage of AR...

Streamlining the migration to an ARM processor based platform

Now available design strategies and documentation to ease the migration to an ARM platform - From one programmer to another.

You could say I’m an “Acorn boy” – I cut my programming teeth on personal computers from Acorn in the late ‘70s and early ‘80s. All were based around the 6502 and, perhaps because, it’s where I started, I still look on that processor with great affection. I’ve used many other devices in my 25-year career in embedded software – 6502, 6809, 6811, Z80, 68000, 8051, 8086 to name a few. For the last eleven years, I’ve worked at ARM and have worked almost exclusively with various flavours of the ARM architecture. I had worked with ARM processors before that, though. In my life as a consultant software developer, I was part of the team which ported BSD Uni...

Using Cache Coherency to Verify the AMBA4 Protocol

The Jasper User Group Meeting was held on November 8 & 9 and was full of presentations on the diverse ways that users are applying formal techniques – some in areas where never before thought possible. Paul Martin from ARM was one of those users who presented on this topic. ARM discussed how modern multi-core processors now require much more sophisticated cache control than before, ensuring that all devices in the system have the same view of shared data, known as cache coherency. ARM in particular has created some quite sophisticated protocols, AXI Coherency Extensions (ACE), under the AMBA 4 umbrella, that they announced at DAC.

The need to move cache management to hardware
In the old days, cache coherency management was largely done in software, invalidating large parts of the cache to ensure no stale data could get accessed, and forcing the cache to gradually be reloaded from main memory. There are several reasons why this is no longer appropr...

FINFET: Has its time finally come for a sub - 20nm 3D device?

Tri-gate or Fin-FET devices have been scrutinized for about 10 years and are considered a viable solution only when conventional planar MOS devices are not able to deliver the expected performance while keeping a low enough leakage current. When scaling down the gate length it is increasingly difficult to control the vertical electric field between the gate and the substrate, while maintaining the channel depletion below threshold and then minimize the leakage current (and consequently the static power). As we move to 20nm and beyond process technology, Fin-FET design may earn its place as the technology path of the future.

The SOI community has worked for more than 20 years on fully-depleted transistors (Figure 1). Thanks to the limited silicon thickness between the gate and the buried oxide, the channel region is fully depleted (hence the name) below threshold, an inversion region forming above threshold to enable the current flow between source and drain. Unlike the planar bulk device, the fully-depleted device brings an additional control knob; the vertical electric ...

There’s more to “Low” in Low VDD than meets the eye

In text books, academia, and articles there is a consensus that to ensure the material is understood it helps to define terminology. I will define Low VDD in the context of a low power supply being connected to my Integrated Circuit of Choice. It does not necessarily imply low power or low energy which is where I think we, in the IP business need to continue directing our focus.

Mathematically, yes power, both dynamic and static, are functions of “V” and you can find enough literature out there with many equations of sorts. So in theory, the lower we make the VDD the better, right? Well it’s not just about V or Vmin, it’s also about other entities like t(time) and RC (resistance and capacitance). If you operate at 0.6V and it takes you twice as long to switch your circuit you will be consuming potentially more crowbar current as you arduously try to switch, so in the end you haven’t really saved that much on either your battery or your PG&E bill. And hiding behind the “t” issue is the RC. It will hinder the propagation of signals through your elegantly designed circuits, boards and systems. So unless you have a lossless system, which doesn’t exist, RC will always be there slowing you down - interfering with the ideal world that Lowering VDD is your solution to lowering power. To make matters worse, RC does not often scale as much as we would like. Another issue is it’s not just about wire RC, but ...

将大小计算引擎完美地整合在-起 - ARM Cortex-A7

今天,ARM Cortex-A7 的隆重推出…标志着 big.Little 处理架构的最终实现!

我驾驶着一辆节能型的本田飞度行驶在 20 英里的市区上班途中。有时我会突发奇想,希望在自己的座驾上安装一个更快的引擎,但大部分时间我还是对自己驾驶的节能车辆感到很满意。但我必须承认我曾抗拒换一辆经济型轿车;我时常梦想自己拥有像保时捷或宝马车那样的性能,但只希望在驾车的小部分时间里拥有这种卓越的性能。如果我能驾驶一辆平均能效保持在四缸引擎范围内,而在需要最高性能的片刻可迅速切换到高性能的涡轮增压式 V8 引擎汽车,那该有多好啊?如果平均燃油能效接近于 4 缸引擎,而最高性能接近于涡轮增压式 V8 引擎的性能,会怎样?
...

big.LITTLE and AMBA 4 ACE keep your cache warm and avoid flushes

High performance and power efficiency are critical to the latest mobile devices, and AMBA® 4 ACE™ is a fundamental technology supporting ARM’s big.LITTLE processing. In case you missed the announcements, the big.LITTLE technology offers an innovative way to run the ‘always on’ tasks on the highly efficient Cortex™-A7 processor, while the high performance and responsive applications are predominantly executed on the Cortex™-A15 processor. So what does this have to do with AMBA 4? Well AMBA 4 ACE and the CoreLink™ CCI-400 Cache Coherent Interconnect offer the critical glue to join these processors together into a big.LITTLE multi-processing (MP) system. Let me explain…

Earlier this year ARM announced the public release of the AMBA 4 phase 2 specification including AC...

Energy Efficiency and Air Conditioning - Part 2: ARM Cortex-A7

ARM Cortex-A7 processor…It's all about right-sized equipment.
In Part 1 of this blog we saw how right-sizing of air conditioning is vitally important because it performs three different functions simultaneously: Cooling, dehumidification and ventilation. Increases in efficiency could be obtained by separating out these three functions and optimizing them independently. As we saw last time, pumping air through ducts is inefficient due to the wasted pumping energy. You could use a hydronic system like a traditional underfloor heating system as is common in northern Europe, but with cooling in the ceiling as well as heating in the floor. Pumping water is a more efficient way to move energy than air. Even though the water pipe is physically smaller than an air duct, in terms of thermal energy transfer capacity it's a fatter pipe. But this wouldn't dehumidify or ventilate, so you'd still need a very small A/C system with air ducts to provide these functions. Duct size and pumping losses would be much lower than a pure ducted central air system though...

Simplifying SoC's with Hard Macros - New solutions for old problems

At the TSMC Open Innovation Platform (OIP) forum in San Jose this week Mike Inglis, the General Manager of the Processor Division at ARM, presented a keynote speech succinctly titled “Enabling Smart System Design Through Collaboration, Optimization and Scalability”. This presentation outlined many of the challenges faced by silicon developers around the globe, and noted some solutions that can help reduce these design pains.

For me, the most important aspect of this talk was the public announcement of the availability of a new Cortex™-A5 Hard Macro for the TSMC 40nm Low Power node (40LP) which can achieve a whopping speed of over 1GHz in a tiny footprint of just 1mm2.

This Hard Macro is based on a uni-processor implementation of the Cortex-A5, and includes NEON™, Flo...

Combining large and small compute engines - ARM Cortex-A7

Today the ARM Cortex-A7 processor was announced…the power of big.Little processing is finally realized!

I drive a Honda Fit, mainly for the fuel efficiency, on a 20 mile city street commute. Sometimes I wish my car had a faster engine, but most of the time I’m happy to drive for high gas mileage. But I have to say I was a reluctant convert to economy cars; I often find myself longing for the performance of a Porsche or BMW, but I only really want that performance a small percentage of the time I’m driving. Wouldn’t it be great if it were possible to drive a car with the average efficiency of a 4-cylinder engine, a car that could switch to a high performance of a turbocharged V8 engine for the small percentage of time you actually wanted peak performance? What if the average fuel economy was closer to the 4-cylinder and the peak performance was closer to that of the turbo V8?

...

Energy Efficiency and Air Conditioning - Part 1

I’ve recently realized just how similar mechanical and electronic engineering are when it comes to energy conservation. You see, I've recently begun construction of an energy-efficient, Frank-Lloyd Wright -inspired architect-designed house in Lakeway, Texas just outside Austin where the ARM US engineering team is based. We're coming to the end of a long and very hot summer here in Austin so naturally I spent quite an amount of time researching energy-efficient solutions to air-conditioning (A/C). To minimize energy consumption we're putting in an unusual heating and air-conditioning system which consists of eight ceiling cassettes all connected via refrigerant pipes to a single, external electronic inverter-controlled infinitely variable compressor. Each ceiling cassette A/C outlet can be individually controlled from an iPhone App from anywhere in the world and can run at any one of four speeds. Instead of pumping air around the house in ducts as with a traditional A/C system we pump refrigerant around in pipes, which is more efficient as it wastes less energy pumping. The total power output capacity of the eight outlets can total up to 130% of the maximum compressor output, but obviously you can't have all eight outlets on full blast at the same time.

In explaining the system to my friend and ARM colleague ...

Clean Sweep at 28nm for ARM Artisan Physical IP

On October 6th, UMC announced the selection of the ARM® Artisan® Physical IP Platform for the UMC foundry sponsored IP program. This new platform for UMC’s 28nm high-K metal gate (HKMG) process is a natural continuation of the long standing relationship between ARM physical IP division and UMC. ARM Artisan IP has been successfully used in millions of SoCs produced at UMC for more than 10 years on 180nm, 130nm, 90nm, 65nm and 55nm process technologies. The addition of UMC to ARM’s family of 28nm Physical IP platforms has a larger meaning than just a high quality set of IP on a technology-leading process. ARM Artisan IP is now the only physical IP platform available at all four of the 28nm commercial foundries in the world: TSMC, UMC, ...

Avoiding rush hour traffic jams in your SoC design

Thank goodness for the new packetization, virtual networking and clock gating features of the ARM CoreLink NIC-400. I’m fed up with hearing about spurious comparisons between a single humungous crossbar switch and NoC solutions. No design of any complexity uses a single cross bar switch! Modern SoC designs integrate large numbers of IP cores supporting the de facto industry standard AMBA AXI interface (open to all) using a network of small switches to allow very high operating frequencies and reduced routing congestion while still maintaining very low latencies and simplicity and flexibility of design.

Smart customers evaluate the alternatives to compare performance (bandwidths and latencies) vs. cost in terms of silicon area, routing congestion, power and price.

Packetization is great to increase wire utilization, but comes with a packetization/depacketization overhead that costs latency, gates and power. So should be used where it helps most to cross long distances with multi-cycle timings, f...

Is Moore's Law Done?

In 1965 Gordon Moore first published his famous observation that the number of transistors on an integrated circuit doubles every year (in 1975 revised the timeline to every two years). Moore based his observation on just a handful of integrated circuits, the biggest of which had less than 100 transistors. Today, several production chips contain over a billion transistors , serving as one of the most astonishing examples of exponential growth in the history of technology. Superlatives abound for this trend, but one of my favorites is this: last year, human beings produced about 100 transistors for every ant on the planet. Predictions that Moore’s Law is running out of gas or is otherwise finished have been around for almost as long as the law itself, but the trend keeps on going. Lately, though, things are looking ominous. Is Moore’s law finally done?

To understand the future of the law, it helps to look back at its history. In its basic form, Moore’s law refers to the most cost-efficient transistor count, but in practice we typically include performance and power scaling as wel...

Watch out for the potholes and road bumps in 20nm design

As Moore’s Law continues to drive the semiconductor industry to smaller and faster transistors, 40nm chips are state-of-the-art, 32nm/28nm cores are right around the corner, and companies are now planning their 20nm flows, methodologies, and products. Foundries have been working tirelessly over the last few generations, adding new, sophisticated steps to the manufacturing process to prolong the power and performance scaling that has defined the semiconductor industry over the last 30 years.

Around the 40nm node, manufacturers began adding in new materials to permanently put silicon transistors under compressive or tensile stress. By adding the correct type of material and inducing the right type of stress, foundries now provide increased carrier mobility in NMOS and PMOS devices. At 32nm/28nm, high-k metal gates are being introduced into the manufacturing flow of all the top foundries. High-k metal gates reduce transistor leakage without reducing performance, and enable further oxide scaling in fu...

Designing an ARM Based SoC: How to Meet Your Power Budget

ARM IP and ARM processor usage is pervasive across multiple segments of the electronics industry. As shown in Figure 1, each of these market segments have unique design challenges and analysis drivers. For example, a SoC targeting a mobile handset or tablet will require high-performance, while still meeting the overall power budget. Memory and I/O IP on the other hand, must be designed for immunity to noise that is coupled via the power grid routing, the package, or the substrate. SoCs targeting automotive or medical industries must meet high reliability standards and minimize their electromagnetic interface (EMI) signature.

Attached Image

Figure 1: Market segment specific design considerations for an ARM IP based SoC.


Using a power and noise budgeting methodology...

Stirring up debate at GTC 2011

Last week ARM participated in the GLOBALFOUNDRIES 2011 Global Technology Conference (GTC11), ARM was involved in two notable panel sessions that covered a range of important industry issues. I wanted to share with you a few highlights of those discussions.

The first is the annual CEO panel, something that is becoming a GTC tradition. This one was entitled “Design Enablement Challenges and Future Solutions”. At this event, right after lunch, CEO’s, get together to talk about industry trends. ARM participated in the panel last year as well, with Simon Segars (EVP/GM of ARM’s Physical IP Division) representing ARM along with nearly the same other panelists. Also like last year, the panel was moderated by Mojy Chian from GLOBALFOUNDRIES, their Senior VP of Design Enablement. The panelists were:
Warren East, CEO, ARM...

Hot Chips Takes a Walk Down Memory Lane - Part 1

Recently I gave a keynote presentation at the Hot Chips conference at Stanford University entitled “ARM Processor Evolution: Bringing High Performance to Mobile Devices”. In my talk I covered three main topics. First, how computing has evolved over the last 30 years from its desk bound origins to the ultra mobile world we know today. Second I talked about ARM’s role in that evolution and some of our early experiences of mobile devices. Finally, I spoke about some of the challenges I see ahead of us. In this blog I will give a recap of my presentation. Here in part 1 I’ll cover the history, and in part 2 look to the future.

Back to the 80’s
There is no doubt that personal computing has come a long way in the last 30 years. In my presentation I started by looking at the leading mobile devices from the early 80’s, namely the Osborne 1 and the Motorola DynaTAC. Launched in 1981, the Osborne 1 is widely recognized as the first portable computer. Weighing in at a solid 24.5lbs, it would be a brave man who carried that on a round-the-world business trip, esp...

Mainstream Technologies Fuel the Internet of Things Revolution

There is an old saying that no matter where you are in the world you are never more than three feet away from a spider. I can’t tell you if this is true, although my wife hopes it isn’t. However it is probably a fact of life that you are never more than three feet away from a silicon chip manufactured on a mainstream technology process (250nm-90nm). Take a look around you right now. The computer screen you are reading this blog on is driven by display drivers manufactured on a mainstream technology node. That mobile phone you just used has several chips manufactured on mainstream nodes for battery management, power control and display functions. Your GPS device, tablet, coffee maker, microwave oven, PMP, video game player, and even the toys your kids are playing with are all powered by mainstream technology. As more of these products incorporate ARM-based microcontroller and microprocessors combined with wireless technology the application goes beyond an internet of ...

Thanks, Big Blue!

A couple months ago IBM celebrated the 100th anniversary of its founding on June 16, 1911. The most advanced computational technology in 1911 was mechanical tabulation machines and punched card data processing equipment. That’s right; a few of you remember these infernal beasts. My first encounter with punched cards and IBM’s JCL (job control language) was in graduate school while trying to get my simulation of the Motorola 68000 CISC processor to execute on the IBM mainframe. As I reflect on that late night frustration I am reminded of two key outcomes of the experience. First, the classmate that helped me through the accounting procedures on IBM mainframe became a lifelong friend. Secondly, I began to appreciate the simplicity of RISC processors and their potential for efficient computation but that is for a later story.

IBM continued under the leadership of Thomas J. Watson, Sr., who adopted the slogan, “THINK,” as his vision of the company culture. This culture of explorat...

Cache Coherency and Verification Seminar at DAC – Now Online!

If you missed DAC, then you missed the seminar on cache coherency and verification of cache coherency given by ARM and Jasper. Learn how to overcome the typical challenges of capturing the intent, reviewing possible scenarios and how to correct errors in functional terms as they relate to the specification.

The seminar (on-line recording) kicked off with a discussion on how the implementation of hardware-based coherency in high-performance parallel compute environments is not new. The seminar quickly revealed that architects and designers of high-performance, heterogeneous, embedded multi-processing SoC’s, particularly those with one or more caches and when many masters share a single area of memory, now require robust specifications, design & verification tools and systems IP, to ensure their devices minimize off-chip memory transactions, while maximizing performance and power efficiency. The seminar then went on to explain why ARM has chosen to include Coherency Extensions...

Elba - Bringing it all together

In the previous three blogs (Parts 1, 2 and 3) I’ve outlined the background and key decisions involved in the development and implementation of the Elba testchip. Now we’ll look at the final steps taken to bring Elba to life.

As our understanding of the various components broadened, the actual SoC architecture design activity then knew the details it needed in order finalize the design. We knew we had two Cortex-A9 macros, a Mali GPU and the various other components, but not too much about how we would best plug them together. Since we also wanted to investigate system level power management, many of the large system components were also placed into their own independent power domains. The layout of the design also became rather ...

Elba Processor Power Management

Having chosen the optimal implementation, as described in the previous blog, we now turned our attention to power management.

Simulations of Elba at this point of the program were starting to supply some rather noticeable power levels for the processor, especially at the design corner we were most familiar with. The worst case design corner is a statistical point across the variations that you could potentially see from a silicon process at a temperature that is assumed to not exist. Remember, ARM's primary market was mobile devices, so for these devices a manufacturer wanted to know that every chip delivered from the fab would achieve the defined performance. So speed would always be defined by the statistically slowest piece of silicon, and the power would be defined by the statistically fastest and hottest piece of silicon – neither of which would ever exist in reality, but allowed the manufacturer to maximise device yield without testing each part for its performance. You may know that around 1 billion phones were sold last year – and that would be a lot of cost to ‘speed-bin’ parts across that market.

There are various ways to speed-bin a SoC, but basically the two main o...

Elba - How do we know it works?

In part 1 of this blog, I outlined the thought process behind the Elba program. Here I’ll look at the implementation decisions for the project.

In ARM there are various stages of maturity of a new processor development, reaching silicon implementation in various fabrication processes is one of those and it made sense to us that Elba must also be a full silicon implementation. In fact, just in case this does work, and what we think may happen does, we’ll implement the Cortex-A9 processor in a way such that ARM could commercialize and promote these “G” implementations as a new product. But what type of silicon? In ARM we often build silicon devices, but these typically are no more functional as a device than something that can execute a little code from on-chip memories. Great, so our goal to build a multi-GHz Cortex-A9 will be able to run Dhrystone – we need more than that . . . How much more? As it ended up, quite a bit more. ARM also develops the Mali 3D graphic processors, so the device should include the...

ARM DAC Review

The 48th DAC was held in San Diego this year and whenever DAC is not in Silicon Valley there is always a concern about how many people are going to show up. Fortunately the sun shone on San Diego and there seemed to be a good number of people in attendance.

This year, for only the second time in history, there was an ARM booth at DAC. In previous years we have shown demos within our partners’ booths but this year we had our own space as well as working with our partners as usual. One of the messages we were promoting at DAC the need for a complete system solution approach to design, as opposed a collection of components designed in isolation of each other. We call this Smarter Systems Solutions and John Heinlein gave a presentation on this subject on the Chip Estimate booth. During DAC, ARM also ...

"Wouldn't it be interesting if we..." - Giving Birth to 'Elba'

“Wouldn’t it be interesting if we....” That’s the way many step changes have started in ARM. In the next four blogs I’d like to take this opportunity to tell you about just one of those, we called it “The Elba Program”. The Elba program is just coming to its end, but its effect is starting to be seen in various places across the web as new markets and devices are starting to appear.

First, a very basic history lesson. ARM processors really took off as the processor that powered the first mobile phones. Phones as I’m sure you know need to be turned on to receive a call, and until more recently most calls were fairly short in comparison to the time a phone sat in what was called standby mode. This requirement quickly became the dominant characteristic for embedded processors and drove an entire branch of the semiconductor industry to look at manufacturing devices that consumed as little energy as possible when in standby at the cost of higher energy consumption when active. At the fabrication level, this technology gained a geometry label “LP” (low power), to differentiate it from the “G” (generic) process that was used by the more general purpose microprocessors.
...

Upgrading Your Verification For Cache Coherency With Jasper!

Are you using a mobile device? Chances are you’re probably reading this blog post on one. You’re also probably reading this with the confidence that your device is doing what you intend it to do. As consumers, we place a lot of demands on not only our mobile devices but also the rest of our personal electronics. We want them to perform all sorts of tasks efficiently, accurately, and with minimal power consumption. As an engineer, you’re aware of the complex embedded SoC’s used in your device to make sure that it does what’s intended. Today’s embedded SoC’s are high performance, heterogeneous, and multi-processing systems.

In the future, most of these embedded SoC’s are likely to contain multiple caches that share a single memory resource. At a high level, cache coherency means that two caches cannot have same cache line in a dirty state and that if a cache contains a cache line in a unique state, that line must not be in another cache. In addition, at least one transaction must always be able make forward progress (no deadlock.) To prevent these cache coherency issues, ARM has included cache coherence extensions in their AMBA 4 protocol specification.

While this is a tremendou...

Building A New ARM Community

This week marks a significant step forward for ARM SoC (System-on-Chip) designers, as we roll out the newest ARM user community. For the past few years, we have been incredibly active in spreading the word about the ARM architecture and building communities with a focus on multimedia, software design, embedded applications, and of course, smart mobile devices. What is different about today’s rollout is that we have peeled back the lid behind ARM Powered devices to build a community around the SoCs and the designers that make these applications possible: the ARM SoC Design Community

...

  • (24 Pages)
  • +
  • 1
  • 2
  • 3
  • Last »
All company and product names appearing in the ARM Blogs are trademarks and/or registered trademarks of ARM Limited per ARM’s official trademark list. All other product or service names mentioned herein are the trademarks of their respective owners.