OPC-1 CPU for CPLD

Description

Here's our first effort for the One Page Computing Challenge: in 66 lines of markdown we specify a CPU, in 66 lines of python we write an emulator, another 66 lines of python for a macro assembler, 66 lines of HTML and JavaScript for another emulator, and of course 66 lines of verilog to define the CPU in a CPLD.

Here's the spec, on our minisite: https://revaldinho.github.io/opc/opc1spec.html

There's not much address space as the CPLD is small - with a bigger CPLD we could have a 16 bit address space without any change to the architecture. But then with a bigger CPLD we might be tempted to expand the architecture.

We have some breadboard-friendly CPLD modules so we can add RAM, UART and ROM to make a computer.

One Page Computing Challenge: https://hackaday.io/project/25091

Project Logs

Collapse

OPC-1 is done!
Ed S • 06/18/2017 at 19:51 • 0 comments
The final change to OPC-1: we removed SEC, because we had to fix a bug in the verilog, and then the machine didn't fit the CPLD, so something had to go. No great problem though, because with our macro assembler we can have an SEC like this:
```
MACRO SEC()
lda.i 0x01
lxa
ENDMACRO
```
You can run an emulation of OPC-1 in your browser: see here for a trivial program, or here or here for longer programs. (It's an emulator without many features, because, of course, it fits on one page.)
Our next mini-adventure was to see about an OPC-2 - instead of an accumulator machine, how about a load-store machine? More registers, operations on registers, and the only memory accesses to be loads and stores.
Unfortunately, although we did manage to make a working machine, fitting in the CPLD and again with all sources on one page, it didn't turn out very satisfactory. The CPLD is so small we could only have two registers, and the address space shrunk again to just 10 bits. We could only afford load and store from one of the registers. We managed a conditional jump and a jump-and-link, so again one can manage subroutines. But there was so little room JAL takes two bytes even though the second byte contains no information.
The spec is here, and the in-browser emulator is here.
At this point we've run out of room in the CPLD, and it's time for a new project. Although we might come back to OPC-1 and get it running in a breadboard - so far we've been exploring with synthesis, simulation and emulation.
For one last gasp, we wrote an OPC-3, which is a simple-minded expansion of OPC-1 into a machine with 16 address and 16 bit data. We quite like word-addressed machines, so this is one of those: addresses are not bytes! There's no simple way to access bytes although of course you can always shift and mask. But a benefit of this size of machine is that a value, whether in register or memory or as an operand byte, is always big enough to specify a full address.
OPC-3 is a bit too big for our CPLD and a bit too simple to be impressive. The instructions have lots of unused bits. But it leads us to think of interesting directions for the next project.
The spec is here and the in-browser emulator is here.
OPC-1 gets indirect addressing!
Ed S • 06/16/2017 at 18:00 • 0 comments

Still fitting within the CPLD, and still keeping source, spec and emulator within 66 lines, we managed to add a feature: indirect addressing.
Having reduced the address space from 12 to 11 bits, so we could fit in the CPLD, it turns out we had spare logic capacity (but of course no spare flop capacity) and we also had one bit freed up in the instructions.
So now we have a 5 bit opcode field. We gain a load instruction - LDA - and the store instruction - STA - gains a second addressing mode. In both cases there's an extra level of indirection: the effective address is the value loaded from the address given in the instruction. Because the pointers are fetched using only an 8-bit address, this gives the machine a zero page, like the 6502.
We also gain a set carry instruction, SEC, hoping to make subtraction-by-addition a little easier, as we lack a subtract instruction. And we gained a little by squeezing the carry bit into the link register. We needed that - the design now uses 100% of the Function Blocks in the CPLD.
Here's the updated spec.
OPC-1 gets subroutines!
Ed S • 06/13/2017 at 13:49 • 0 comments
We made an update, adding a link register and three instructions, which give us subroutine capability. There's still no stack!
The cost of this update, in fitting in the CPLD, was one address bit, so we move down from a 12 bit address space to 11 bits.
JSR stores the current PC in the link register and the accumulator - the PC is 11 bits, the accumulator only 8, so we needed a 3 bit link register. RTS copies the link and accumulator into the PC. That's almost enough, but to save a return address we need access to the link register, so LXA exchanges the link register and the accumulator.
In fact here's the spec on the subject:
- Subroutine call and returns are handled by the LXA, JSR and RTS instructions. When a JSR instruction is issued the upper 3 bits of the PC are placed into a link register and the lower 8 bits are stored in the accumulator. On entering the subroutine the accumulator value should be saved and then using the LXA instruction the link register bits can be moved to the accumulator and saved also. On leaving a subroutine the return address needs to be retrieved by the reverse of this process. First get the upper return address bits into the accumulator, then transfer them to the link register using LXA and finally get the lower return address bits into the accumulator. Executing the RTS will now set the PC to the 11 bit value made by concatenating link register with accumulator.
OPC-1 first cut
Ed S • 06/10/2017 at 12:46 • 0 comments

Our aim here was to see if we could fit a useful CPU on a CPLD. We chose the Xilinx 9572 because we've used it before, and there's a breadboard-friendly dev board for it.
At the same time we wanted to see if we could describe a CPU in one page.
The 6502 is nice and simple but is too large for a CPLD and too complex to be described in one page, so we started with that and threw out the stack pointer, the index registers, and almost all the instructions. We're left with an accumulator machine, and we kept just two flags: the carry and the zero.
The first cut has a fixed instruction format of two bytes, which allows for two addressing modes: eight instructions in direct mode with a 12 bit operand, and sixteen instructions in implied/immediate mode with just an 8 bit operand. So we get a 12 bit address space, and a 256 byte zero page. We get LDA, ADD, SUB, and AND with two addressing modes, and STA. NOT, JP, JPC, JPZ and SEC with just one addressing mode.
With this version, not only must stack management be manual, but also subroutines have to be managed manually, perhaps by using a Wheeler Jump. Maybe self-modifying code would be essential. It's certainly a fully capable CPU though.
At this stage we also had an assembler and an emulator, both written in Python. See GitHub.

View all 4 project logs

Discussions

SHAOS wrote 08/18/2018 at 08:15

I think OPC soft-cores could be perfect lab rats for my C++HDL project https://hackaday.io/project/160393-trcm :)

Are you sure? yes | no

SHAOS wrote 08/18/2018 at 17:31

Thank you for liking my little experiment ;)
I'm running your tests in https://github.com/revaldinho/opc/tree/master/opc1

It looks like one test is not there:

FileNotFoundError: [Errno 2] No such file or directory: 'ptrtest.s'

Are you sure? yes | no

Ed S wrote 09/30/2018 at 11:58

Oops, looks like you're right - posted an issue on github...

Are you sure? yes | no

Ed S wrote 09/30/2018 at 16:11

...fixed now!

Are you sure? yes | no

SHAOS wrote 09/30/2018 at 23:49

Thanks! :)

Are you sure? yes | no

Hacker404 wrote 09/18/2017 at 23:08

I like this but I can't follow the hardware setup.

The hardware is a mystery to me as I use VHDL and not Verilog.

I have lots of these Xilinx CPLDs. I am not sure if I have the small breakout that you have but I know I have may larger breakouts. I also have various packages and about 100 XC9636XL s. It would be an interesting challenge to see what could be done in a XC9536XL. I also have some Altera CPLDs like EPM240 EPM570 Cyclone II etc.

Are you sure? yes | no

Bill Rowe wrote 07/11/2017 at 20:20

This is great. Giving up an address bit to add an instruction seems like a wonderful tradeoff. How would I get started with the CPLD?

Are you sure? yes | no

Ed S wrote 07/11/2017 at 20:26

Thanks! For the CPLD, you need a Xilinx programming dongle, which they call a "Platform Adapter" - $50 or so from China - and then the free download software from Xilinx, WebPack ISE.

Are you sure? yes | no

Hacker404 wrote 09/18/2017 at 23:29

There are cheaper alternatives like this -

http://www.ebay.com/itm/311775553890

It's an Altera version with a CPLD on a breakout and a programmer for under $10. With the Altera Version the breakout board can be powered by the programmer from the USB 5 Volt supply. Very convenient.

It's an Altera EPM240 which is much bigger than the Xilinx XC9572XL. It's also 5Volt tolerant like the Xilinx XC9572XL.

Also the Xilinx ISE (WEB) is a pain to install and register because of the complex process to get a registration key even though it's free.

The Altera IDE is called Quartus and installs easily.

The Xilinx ISE will drive you mad with warnings and errors when you start and has a complex system for constraints.

The Altera Quartus is much easier to use and has a pin configuration GUI instead of .ucf files.

I'm not putting the Xilinx ISE package down. It has advantages that I like as well so I use both Xilinx ISE and Altera Quartus. It's just that it's a pain in the ass to start with the Xilinx package first.

Are you sure? yes | no

Ed S wrote 09/20/2017 at 18:34

Altera is a good alternative - but it looks like the chip is only 5v tolerant if you add series resistors. Possibly the breakout board you linked will include those, although it doesn't look like does.

Are you sure? yes | no

Hacker404 wrote 09/20/2017 at 22:30

No, that breakout doesn't have series resistors but I have never had problems mixing the EPM240 or EPM570 with TTL devices. CMOS may be different.

The EPM datasheet assumes that the driving device can drive right up to it's Vcc which for TTL is max 5.5V but in reality modern circuits have a fairly accurate 5.0V supply and TTL Voh is below the EMP's Vih max of 4.0V

For my own designs I also run the 3.3V device at its max Vcc of 3.6V to improve noise margin but all the same I have used several EPM breakout boards from ebay directly with TTL devices and never had a problem.

I did however notice that the EPM has only 100 write cycles for the configuration and user flash and that was surprising.

The EPM is a lot more versatile. I has many options for the IO pins including PCI. There is also some small user FLASH built in.

Do you need 5V compatibility? The Altera Cyclone II chips give a lot better bang for buck than the aging EPM series.

The EMP240 is about 190 macro's compared to the Xilinx XC9572XL's 72 macro's.

This kit for about $16USD is about 3500 macro's but it's not 5V compatible at all.

http://www.ebay.com/itm/162595684147

Still it's a cheap way to cut your teeth with VHDL/Verilog/Schematic entry.

Are you sure? yes | no

OPC-1 CPU for CPLD

Description

Project Logs

Collapse

OPC-1 is done!

OPC-1 gets indirect addressing!

OPC-1 gets subroutines!

OPC-1 first cut

Discussions

Similar Projects

Kobold K2 - RISC TTL Computer

DIP-8 TTL Computer

Kobold - retro TTL computer

ECM-16/TTL homebrew computer

OPC-1 CPU for CPLD

Become a Hackaday.io member

Just one more thing

Description

Project Logs Collapse

Enjoy this project?

Discussions

Become a Hackaday.io Member

Similar Projects

Does this project spark your interest?

Report project as inappropriate

Send message

Remove Member

Project Logs

Collapse