# An Ultra-Wideband-Inspired System-on-Chip for an Optical Bidirectional Transcutaneous Biotelemetry

Andrea De Marcellis, Guido Di Patrizio Stanchieri, Elia Palange, Marco Faccio

Biomedical Electronics and Photonics Integrated Systems Lab, Dept. Industrial and Information Engineering and Economics, University of L'Aquila, L'Aquila 67100, ITALY

Abstract—This paper describes an integrated communication system, implementing a UWB-inspired pulsed coding technique, for an optical transcutaneous biotelemetry. The system consists of both a transmitter and a receiver facilitating a bidirectional link. The transmitter includes a digital data coding circuit and is capable of generating sub-nanosecond current pulses and directly driving an off-chip semiconductor laser diode including all bias and drive circuits. The receiver includes an integrated compact PN-junction photodiode together with signal conditioning, detection and digital data decoding circuits to enable a high bit rate, energy efficient communication. The proposed solution has been implemented in a commercially available 0.35  $\mu m$  CMOS technology provided by AMS. The circuit core occupies a compact silicon footprint of less than 0.13 mm<sup>2</sup> (only 113 transistors and 1 resistor). Post-layout simulations have validated the overall system operation demonstrating the ability to operate at bit rates up to 500 Mbps with pulse widths of 300 ps with a total power efficiency (transmitter + receiver) lower than 74 pJ/bit. This makes the system ideally suited for demanding applications that require high bit rates at extremely low energy levels. One such application is implantable brain machine interfaces requiring high uplink bitrates to transmit recorded data externally through a transcutaneous communication channel.

#### I. INTRODUCTION

Recent progress in implantable neural microsystems have enabled combining high density cortical recording, signal processing, decoding and communication for a variety of scientific and medical applications. Applications for Brain Machine Interfaces (BMIs) in the future will utilise this functionality for actuation and communication in individuals with neurological disease and injury. A key challenge however in implantable device technology for such applications is how to achieve a high bandwidth biotelemetry in an energy efficient manner. These systems need also to operate with reduced size at low supply voltage and reduced power consumption whilst optimising electromagnetic (e.m.) compatibility and signal integrity [1], [2].

There has already been a significant amount of work in implementing a number of techniques mainly based on near and far field wireless data and/or power transfer. This includes carrier-based, narrow band and Ultra-WideBand (UWB) radio frequency (RF) links. These have however predominantly been applied to neuromodulation devices that require the downlink to have a higher bandwidth than the uplink. Downlink data has included control commands and sensory data (into the body), whereas uplink data (out of the body) has been mainly for diagnostic purposes. However for applications requiring high uplink bandwidths (e.g., required for BMIs) these methods face some fundamental limitations due to their relatively high-power consumption and low e.m. compatibility [3]–[6]. Timothy G. Constandinou Centre for Bio-Inspired Technology, Dept. of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK

Optical biotelemetry links have been proposed that employ modulated or pulsed diode lasers as data transmitters and photodiodes as data receivers. These provide several desirable features such as an improved performance particularly in terms of device size, bit rate, power consumption and e.m. compatibility [2], [7]-[9]. Some improvements have been achieved by increasing the laser power, employing photodiodes with larger sensitive area and/or using modified On-Off Keying (OOK) based modulations. These approaches however tend to increase the laser/emitter response time and the signal-to-noise ratio thus limiting the system bandwidth and the maximum achievable data rate up to 100 Mbps with a power efficiency of 21 pJ/bit [2], [8], [10]. Recently, we have demonstrated an UWB-inspired optical wireless biotelemetry system implemented using commercial off-the-shelf components that is able to overcome these limitations reaching data rate up to 250 Mbps and 24 pJ/bit power efficiency [11].

This work develops this approach further by proposing a full-custom ASIC implementation of the complete bidirectional biotelemetry link. This integrates both the transmitter and receiver including all digital coding/decoding blocks as well as a laser diode driver and a photodiode transimpedance amplifier. The only external component required is the laser diode. The integrated circuit has been designed in AMS 0.35  $\mu$ m CMOS technology (C35B4C3), with core circuit occupying a compact silicon footprint of only 0.13 mm<sup>2</sup> (113 transistors and 1 resistor). Post-layout simulations demonstrate the desired system functionality to achieve data rates up to 500 Mbps at an overall power efficiency of less than 74 pJ/bit.

## **II. SYSTEM OVERVIEW**

The proposed system is illustrated in Fig. 1. The communication architecture implements an UWB-inspired optical pulsed modulation technique allowing for a high bit rate link with low power consumption [7]. The system includes both a transmitter and a receiver within each of the implanted and external (body worn) modules. The transmitter takes, as input signals, the main clock *CLK\_IN* and the bitstream *DATA\_IN* to be coded/transmitted, whilst the receiver provides the recovered clock signal *CLK\_OUT* and the decoded bitstream *DATA\_OUT*.

The transmitter includes a 'digital coding' block to modulate the voltage pulses and a 'laser driver' block for biasing and driving the laser diode, converting the voltage into current pulses. The receiver includes a transimpedance amplifier (TIA) that converts current pulses into voltage pulses and a 'digital decoding' block that takes the received voltage pulses and



Fig. 1. System overview of the UWB-inspired optical communication link implementing a pulsed data coding technique with example timing diagram.

performs the clock and data recovery. Finally, the modulation approach implemented has been tailored for a semiconductor laser diode (LD) to generate the sub-nanosecond light pulses and a fast silicon photodiode (PD) to respond with corresponding current pulses.

A basic timing diagram is also illustrated in Fig. 1. The clock and data signals are combined into a single current pulsebased bitstream. A first pulse (i.e., the clock pulse) is always generated at the beginning of each bit period, independently of the symbol to be transmitted, allowing for the transmission of the clock needed for the transmitter-receiver synchronisation (i.e., clock recovery). Then, after half a period, if the symbol '1' is to be transmitted, a second pulse (i.e., the data pulse) is generated, while for the transmission of the symbol '0' there is not further current pulse (i.e., the current preserves its minimum value).

## **III. CIRCUIT IMPLEMENTATION**

The core system including all components (except laser diode) has been implemented in AMS  $0.35 \,\mu m$  CMOS technology. The design has in particular been optimised for data throughput, power consumption and silicon area.

## A. Transmitter Circuits

The 'digital coding' block is based on simple combinational logic, consisting of  $2 \times \text{NOT}$ ,  $2 \times \text{NAND}$  and  $2 \times \text{rising}$  edge triggered pulse generators, shown in Fig. 2. Starting from the clock input *CLK\_IN*, the 'rising edge delay' block generates voltage pulses with a variable width that is regulated by the control voltage  $V_{CTRL}$ . This is based on three inverter stages formed by the transistor pairs M1-M2, M11-M12 and M14-M15. This operates as follows:

- When  $V_{CTRL}=0V$ , diode-connected transistor M5 defines the maximum resistance in the pull-down network of the M1-M2 inverter stage. In a similar fashion diodeconnected transistor M10 sets the maximum resistance in the pull-up network of the M11-M12 inverter stage. Under this operating condition the maximum delay is achieved.
- By increasing the value of  $V_{CTRL}$ , transistor M3 and current mirror M6-M7-M9 progressively turn on. This, in turn increases the current flowing through transistors M4-M8, resulting in an overall reduction in the delay between the rising edges of the input and the output signals.

• The circuit however is designed to directly respond to a falling edge of the input clock signal. This is achieved through transistor *M18* such that only the input rising edge is delayed by the *'rising edge delay'* block.

Referring now to the overall 'digital coding' block, the input *CLK\_IN* follows two different paths that provide pulse trains with a relative phase delay equal to  $180^{\circ}$  (i.e., half a clock period). These signals are then combined with the input data stream *DATA\_IN* through the NAND gates so as to provide a single output pulse train incorporating both the *CLOCK\_PULSE* and *DATA\_PULSE* components (see Fig. 1).



Fig. 2. Circuit schematic of the 'digital coding' block.

This signal then feeds the laser driver circuit, shown in Fig. 3. This is based on a current mirror stage formed by transistors M4-M5, converting the VOLTAGE\_PULSES into CURRENT\_PULSES to directly drive the LD. Moreover, this allows for the regulation of both the pulsed current amplitude and the DC current level through the two control voltages  $DC_{CTRL}$  and  $AC_{CTRL}$  that act on transistors M2 and M3, respectively.

The LD to be employed will be wire-bonded directly to the ASIC. This will be a high-speed Vertical Cavity Surface Emitting Laser (VCSEL), featuring compact size (even smaller than the ASIC) and fast response (high bandwidth) to pulsed modulations. The required VCSEL specifications are as follows: emitting wavelength  $\lambda$  in the range 800–1000 nm, a maximum output light power less than 2 mW, a beam diameter lower than 500 µm, a threshold current lower than 1 mA, rise and fall times lower than 100 ps (e.g., Philips ULM850-05-TN-N0101L [Online]). The wavelength (800-1000 nm) has been selected specifically to minimise issues related to scattering, propagation and attenuation due to the presence of the skin/tissue [12]. In this spectral region, an optical penetration depth higher than 2.5 mm and an absorption coefficient lower than 1.1cm<sup>-1</sup> are achieved. Moreover, to avoid/reduce channelcrosstalk between the external and implanted devices during simultaneous bidirectional communication, two different LD emitting wavelengths  $\lambda$  of 850 nm and 980 nm have been chosen to be used in the external and implanted transmitters, respectively.

## B. Receiver Circuits

Within the receiver subsystem, the main block is the PD interface circuit, shown in Fig. 4. This converts *CUR*-*RENT\_PULSES* received from the PD and generates the *VOLTAGE\_PULSES* signal. This is based on a single stage transimpedance amplifier (TIA) implemented by the transistors M1-M5, with resistor R1 defining the transimpedance gain. Additional gain is provided through a cascade of three CMOS inverter stages implemented by transistor pairs M6-M7, M8-M9 and M10-M11, generating the output signal *VOLTAGE\_PULSES*. Furthermore, the  $R_{EXT}$  terminal allows for the overall gain to be precisely adjusted through an optional external resistor (i.e., connected in parallel with R1).

The proposed ASIC design additionally includes five different integrated PN-junction photodiode devices that target the following specifications: maximum responsivity>0.5 A/W (at  $\lambda$ =850 nm), junction capacitance<1 pF, rise/fall times<100 ps, reverse bias voltage<3.3 V (e.g., similar to Philips ULMPIN-10-TT-N0101U [Online]). The five photodiode prototypes have been designed to test different PN-junctions (n-well/p-substrate, n+/p-substrate) and geometries (single junction, multiple parallel-connected junctions). The single junction structures tasselate 120× small unit cells aiming to minimise transit time (and thus maximise the bandwidth), with total active area 140×140 µm<sup>2</sup>. All the prototype photodiode structures include an external guard-ring to uniformly biasing the underlying p-substrate.

Resuming, the VOLTAGE\_PULSES signal generated by the PD interface circuit then feeds the 'digital decoding' block that performs the data and clock recovery (i.e., generating the CLK\_OUT and DATA\_OUT outputs). The circuit schematic is shown in Fig. 5. This consists of  $2 \times$  D-type Flip-Flops (FF\_D),  $2 \times$  inverters,  $1 \times$  'rising edge delay' block (described previously) and  $1 \times$  'phase control' block. This operates as follows:

• The VOLTAGE\_PULSES signal drives the clock input CLK of the first FF\_D, with the data input D fixed to logic level 1. In this way, the first incoming CLOCK\_PULSE sets the output of the FF\_D to a high logic level and thus any further DATA\_PULSE cannot affect the FF\_D output.



Fig. 3. Circuit schematic of the '*laser driver*' circuit (laser diode connected to the output node *CURRENT\_PULSES* as shown in Fig. 1).



Fig. 4. Schematic of photodiode interface circuit (photodiode connected to the input node *CURRENT\_PULSES* as shown in Fig. 1).



Fig. 5. Circuit schematic of the 'digital decoding' block.

- The output signal of the FF\_D\_1 is then maintained through the 'rising edge delay' block. After this fixed time delay, the 'rising edge delay' triggers the FF\_D's asynchronous reset pin RST to toggle its output back to the low logic level 0. The system is now ready to accept the next CLOCK\_PULSE.
- Since there is a fixed phase relationship between the *CLOCK\_PULSE* and *DATA\_PULSE* signals, it is essential to adjust through the *'rising edge delay'* block the pulse delay for different data rates such that: *T/2<pulse delay<T*, where T is the clock period (i.e., period between two consecutive *CLOCK\_PULSE* signals).
- Consequently, the output of the first FF\_D provides the recovered clock signal *CLK\_REC* but with a duty cycle higher than 50% (i.e., half a clock period plus the additional pulse delay).
- The second FF\_D then receives the inverted *CLK\_REC* and acquires the data input provided by *DATA\_PULSES*. In order to avoid a metastable state in FF\_D\_2, a '*phase* control' has been included to guarantee the FF\_D\_2 setup and hold times. Referring to Fig. 5, the '*phase* control' block consists of  $4 \times$  inverter stages implemented by transistor pairs *M1–M2*, *M5–M6*, *M7–M8* and *M9–M10*. Also, transistors *M3–M4* are driven by control voltages  $V_{CTRL_U}$  and  $V_{CTRL_D}$  that allow for the pulse width of the received *VOLTAGE\_PULSES* signal to be extended.
- Finally, FF\_D\_2 provides the decoded data *DATA\_OUT* signal with a stable value corresponding to the rising edges of the recovered clock signal *CLK\_OUT*.

## IV. POST-LAYOUT SIMULATION RESULTS

The overall system has been developed using the Cadence Design Systems Virtuoso tool suite using foundry-provided simulation and physical verification models. All the blocks described previously have been integrated into a single integrated circuit design targeting a full-duplex (concurrent up- and downlink transmissions) bitrate of 500 Mbps. The complete system layout including integrated photodiodes and physical bondpads (with ESD protections) is shown in Fig. 6.

The overall size of complete test chip design is  $1.6 \times 2 \text{ mm}^2$ . It should be noted that the core circuits themselves only occupy approximately 0.13 mm<sup>2</sup> (the transmitter: 53 transistors and 1 resistor; the receiver: 60 transistors). The circuit operates from a single 3.3 V supply voltage, requiring a maximum total power consumption (with both transmitter and receiver active) of less than 37 mW at a data rate of 500 Mbps and a pulsed signal width of 300 ps. Fig. 7 shows a post-layout simulation (transient analysis) demonstrating the system operating with a clock signal of 500 MHz (i.e., 500 Mbps data rate). The transmitter uses a pulse width (i.e., for *CURRENT\_PULSES*) of 300 ps with minimum and maximum current levels of approximately 1 mA and 25 mA, respectively (for driving the LD). On the other hand, the receiver takes a *CURRENT\_PULSE* input signal of about 70  $\mu$ A (emulating the expected PD photocurrent based on our previous measurements [7], [11]). It is important to observe also that the recovered clock signal presents a duty-cycle of about 70% which, if required, could be suitably compensated by a duty-cycle correction circuit [13].

These results demonstrate that the transmitted repeated bit sequence  $\{0, 1\}$  is correctly decoded by the receiver and is in synchronisation with the recovered clock signal. Moreover, multiple post-layout simulations have been performed to fully characterise the system at different operating frequencies (i.e., different data rates) in addition to varying pulse widths – to assess the overall system power consumption. Table I summarises the simulated power efficiencies of both the transmitter and receiver blocks for frequencies of 125 MHz, 250 MHz and 500 MHz, and pulse widths of 300 ps and 500 ps for a fixed pulse current of 25 mA.

TABLE I POWER EFFICIENCY OF TRANSMITTER (TX) AND RECEIVER (RX) SUB-SYSTEMS UNDER DIFFERENT OPERATING CONDITIONS.

| Data rate | Pulse width = 300 ps |             | Pulse width = 500 ps |             |
|-----------|----------------------|-------------|----------------------|-------------|
| (Mbps)    | Tx (pJ/bit)          | Rx (pJ/bit) | Tx (pJ/bit)          | Rx (pJ/bit) |
| 125       | 85                   | 121         | 94                   | 121         |
| 250       | 57                   | 67          | 65                   | 67          |
| 500       | 34                   | 40          | 41                   | 40          |

## V. CONCLUSION

This work has reported a CMOS integration of a full-duplex communication system, implementing an UWB-inspired pulse coding technique, for an optical transcutaneous biotelemetry. Post-layout simulations have shown that the developed system is capable of transmitting data rates up to 500 Mbps with a 74 pJ/bit total energy efficiency. This capability (both the data throughput but also power levels) would allow for future



Fig. 6. Complete layout design of the optical biotelemetry system.



Fig. 7. Post-layout simulation results: key signals within the communication system operating at 500 Mbps and transmitting a repeated  $\{0, 1\}$  bitstream.

integration with implantable intracortical recording systems [14]–[16] allowing for transmission of over 2,000 channels of raw data. The full custom integrated circuit is currently in fabrication.

#### REFERENCES

- H. Yuan and B. He, "Brain computer interfaces using sensorimotor rhythms: current state and future perspectives," *IEEE Trans. Biomed. Eng.*, vol. 61, no. 5, pp. 1425–1435, 2014, [Online].
- [2] T. Liu, J. Anders, and M. Ortmanns, "Bidirectional optical transcutaneous telemetric link for brain machine interface," *Electronics Letters*, vol. 51, no. 24, pp. 1969–1971, 2015, [Online].
- [3] H. Miranda et al., "HermesD: a high-rate long-range wireless transmission system for simultaneous multichannel neural recording applications," *IEEE Trans. BioCAS*, vol. 4, no. 3, pp. 181–191, 2010, [Online].
- [4] A. Ebrazeh and P. Mohseni, "30 pJ/b, 67 Mbps, centimeter-to-meter range data telemetry with an IR-UWB wireless link," *IEEE Trans. BioCAS*, vol. 9, no. 3, pp. 362–369, 2015, [Online].
- [5] M. Elzeftawi and L. Theogarajan, "A 10 pJ/bit 135 Mbps IR-UWB transmitter using pulse position modulation and with on-chip LDO regulator in 0.13 μm CMOS for biomedical implants," in *Proc. IEEE BioWireleSS.* IEEE, 2013, pp. 37–39, [Online].
- [6] M. Crepaldi *et al.*, "An ultra-wideband impulse-radio transceiver chipset using synchronized-OOK modulation," *IEEE JSSC*, vol. 46, no. 10, pp. 2284–2299, 2011, [Online].
- [7] A. De Marcellis *et al.*, "A pulsed coding technique based on optical UWB modulation for high data rate low power wireless implantable biotelemetry," *Electronics*, vol. 5, no. 4, p. 69, 2016, [Online].
- [8] T. Liu *et al.*, "In vivo verification of a 100 Mbps transcutaneous optical telemetric link," in *Proc. IEEE BioCAS*. IEEE, 2014, pp. 580–583, [Online].
- [9] D. M. Kuchta *et al.*, "A 56.1 Gb/s NRZ modulated 850 nm VCSELbased optical link," in *Proc. IEEE OFC/NFOEC*. IEEE, 2013, pp. 1–3, [Online].
- [10] K. Chi *et al.*, "Energy-efficient prefix-free codes for wireless nanosensor networks using OOK modulation," *IEEE Trans. Wireless Comms.*, vol. 13, no. 5, pp. 2670–2682, 2014, [Online].
- [11] A. De Marcellis *et al.*, "A 250 Mbps 24 pJ/bit UWB-inspired optical communication system for bioimplants," in *Proc. IEEE BioCAS*. IEEE, 2017, pp. 1–4, [Online].
- [12] A. Bashkatov *et al.*, "Optical properties of human skin, subcutaneous and mucous tissues in the wavelength range from 400 to 2000 nm," *Journal of Physics D: Applied Physics*, vol. 38, no. 15, p. 2543, 2005, [Online].
- [13] A. De Marcellis, M. Faccio, and E. Palange, "A 0.35 um CMOS 200 kHz-2 GHz fully-analogue closed-loop circuit for continuous-time clock duty-cycle correction in integrated digital systems," in *Proc. IEEE ISCAS*. IEEE, 2018, pp. 1–4, [Online].
- [14] Y. Liu *et al.*, "A 64-channel versatile neural recording SoC with activitydependent data throughput," *IEEE Trans. BioCAS*, vol. 11, no. 6, pp. 1344–1355, 2017, [Online].
- [15] J. J. Jun *et al.*, "Fully integrated silicon probes for high-density recording of neural activity," *Nature*, vol. 551, no. 7679, p. 232, 2017, [Online].
- [16] S. Luan *et al.*, "Compact standalone platform for neural recording with real-time spike sorting and data logging," *J. Neural Eng.*, vol. 15, no. 4, 2018, [Online].