Skip to content

bit0fun/Senior_Design

Repository files navigation

University of Rochester Senior Design Project:

Low Power Server

About the Project

We have designed and assessed the feasibility of a low power server that can display a website page within minimal latency (1-10 seconds) of a standard web server. Due to the complexity and time constraints, we have created designs for the motherboard, toolchain for the custom processor, and simulated programs of necessary drivers. The motherboard is ready to be sent for first prototypes, and the assembler and linker function flawlessly. The simulator runs Fusion-Core executable ELF files, with a proper memory view and register dump of the running program.

The motherboard

The motherboard adheres to the Mini-ITX standard, so it is compatible with any existing computer case that supports Mini-ITX motherboards. In order to realize the custom processor, we are using a Xilinx Artix-7 35T FPGA. This FPGA also contains DDR3 hard IP as well as 4 TX and RX 3.125 Gb/s transceivers, which we have utilized. The simulated power draw of the system met our expectations with a total power draw under maximum load around 15W, with approximately 2W of dissipated power. This does not include the external ATX power supply. The devices and IO connectivity are listed below.

  • Xilinx Artix-7 35T 484 pin FPGA
  • 4Gb of DDR3L RAM
  • 10/100 Mb/s Ethernet MAC and PHY controller
  • 4x link Expansion Card interface, maximum of 25 Gbps transfer (TX/RX combined)
  • Micro SD Card connector
  • HyperBus Flash and RAM IC
  • Custom power conditioning design (88% efficient, 2W of dissipated power, ~15W total)
  • ATX 2.0 compatible Power Connector
  • SPI BIOS configuration flash, and FPGA configuration flash
  • 6 pin debug pin header
  • Standard Xilinx JTAG connector

More information about the design can be found in the Reports folder in the repository.

Fusion-Core ISA

The Fusion-Core ISA was designed with two goals in mind: Simple Decoding of instructions and the ability to easily add custom instructions. Due to the nature of this design, the main instruction set was chosen to be very small and compact, which allows for implementation in a wide variety of applications. Specifically relating to servers, the ability of adding additional instructions is useful in that external processor cards, such as PCIe FPGA expansion cards, are not necessary. This design does not require such functionality, but we felt it is important to include expandability and scalability in such a design even if it is not specifically made for the purpose of high performance. More information about the Fusion-Core ISA can be found here.

Fusion-Core Simulator

In order to verify our design before testing the implementation of the Fusion-Core processor, we created a simulator to ensure proper execution of our programs. This freed us from requiring a hardware debug unit in the immediate time frame, as we can check execution by either running the program in full, or stepping through a program. With access to register views and a text file simulating the whole memory, we can effectively emulate the processor as close as possible without requiring additional hardware or software. More information about the Simulator can be found in the Reports directory within the repository.

Server Side Software

While we were unable to run the software on actual hardware due to the limited budget and amount of time we had, we utilized the simulator to emulate the memory mapped components such that it would function in the same way on real hardware. We created drivers for the ethernet controller and SD card, as well as the beginnings of an TCP/IP stack and HTTP server. At the time we were working on this project, there was no GCC port for the Fusion-Core ISA, so all of the programs were written in Assembly language. More information about the software can be found in the Reports directory within the repository.

Use Case

For low to mid range performance applications, we propose this to be a better solution than commercial servers. As the power consumption will be far lower, we take the trade off of performance as web applications are not CPU intensive. Outside of server farms and high performance computing, having systems that are underutilized yet draw a large amount of power, is a waste in terms of energy and space.

We do wish to clarify that this solution will not work for all use cases and we do not recommend use for a high number of accesses to the server, nor high compute power applications. In these use cases the benefits of a lower power server will not energy efficient compared to a commercial system due to the amount of time the program will run. While the Fusion-Core ISA does allow for custom instructions allowing for hardware acceleration, that part of the specification is still under development at the time of finishing this project. Due to this fact, we focused strictly on the main core instructions.

Differences between Existing Solutions

The internet connection bandwidth and data transfer speed between the processor and Ethernet controller is the main bottleneck for these applications, so by reducing the performance of the processor we can maintain similar speeds or continue to lower performance to further reduce power consumption while still maintaining functionality as the rate users are used to.

Another important point is that the program to be run on the server is meant to be 'bare-metal' or run without an operating system. By reducing the program size, we not only can gain a small performance benefit, but also reduce the amount of execution time vs running in idle, and further reduce power consumption.

FAQ

Why not use a Raspberry Pi or old computer instead?

There are two main reasons to our decision of not using existing hardware. The larger of the two, would be fine control over the power draw of the system. Since the entire purpose of the design is to provide an overall lower power system, being able to choose and design an efficient power supply and high efficiency components is a requirement.

The second reason focuses on more of the software capability, as we are running the applications bare-metal. Without an operating system, again we get fine tuned control over what is going on, and can optimize for lower power as less excess code will be running. An added benefit of course is a bit more performance, as again no extra code would be running. With an existing system, we would need to have a complete understanding of how to create the drivers for that system. Due to most computers (if not all) not having freely available documentation, especially the Raspberry Pi with large binary blobs for certain drivers, we would not have been able to design a solution in the way we intended.

Why not use an existing ISA?

While using an existing ISA may have reduced development time, using a custom ISA provides us with full understanding of the implementation. Instead of spending time sifting through existing ISAs' (usually poor) documentation and learning what design decisions the creators have made, we can instead put time into developing our own system and then take less time in writing software as we developed the system from the ground up.

The reason why we chose to use the Fusion-Core ISA specifically, is the added bonus of expandability to the ISA. The Fusion-Core ISA allows for custom instruction additions, which are called Co-Processors in the specification, with the added bonus of forwards and backwards binary compatibility. In designs that require high reliability that will be implemented for long periods of time, this amount of customization and compatibility is certainly advantageous.

What is the estimated cost of manufacture?

For one-off prototypes, we have been quoted at around $1.6k, for a 1 month turn-key operation. In the event that we were to manufacture this design in quantities larger than 1,000 , we could reduce materials costs to approximately $120 or less per unit.

About

Senior Design Project

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published