Stuff I do

From Scratch: building a Time Domain Reflectometer

2019-11-09T06:24:00.003-08:00

TDR or Time Domain Reflectometry is a method for measuring transmission lines - cables and alike - used for the transmission of high-speed signals.

TDR can be used to check the length of the cable, can detect the presence and the place of a short, break, and other errors.

First, let's take a look at what we're going to build and what we can do with it.

Transmission line

A transmission line is a specific cable or other structure that's used to conduct high-frequency AC current. A transmission line might carry voice, provide AC mains to your home or carry very low voltage radio frequency signals.

We mostly care about radio frequencies of small or moderate power, so "transmisson line" will mostly mean coaxial (unbalanced) cables, mostly 50 Ohm RG58, H155, H1000, etc. The devices in this article can be modified to work with cables of various characteristic impedance and balance. (Worst case you might need to use a transformer.)

Velocity factor

TDR can be used to measuring feed line (cable) length or velocity factor. Velocity factor is a number given in percents or in fractions of one, and it is the ratio of the signal propagation speed in the transmission line to the speed of light in vacuum. A velocity factor of 66% or 0.66 means the signal propagates at 0.66 * 300,000 km/s = 198,000 km/s.

The method

If a signal is sent along a cable, it will (partially) reflect from the end unless it is terminated with the characteristic impedance. Receiving the reflected signal and measuring the time of flight makes possible to calculate the cable length OR the velocity factor. Knowing one is needed to calculate the other.

We're going to use short, sharp pulses. Such pulses can be generated with cheap transistors working in avalanche mode, and can be read using cheap oscilloscopes (relatively speaking). To make the transistors work well in avalanche mode, we're going to need a voltage well in excess of one hundred volts.

The setup

The measurement setup consists of 3 devices:

An oscilloscope. An old analog or a modern (but cheap) digital one will suffice. 100MHz of bandwidth is probably the minimum, 200MHz is recommended. More will not hurt, but faster scopes get expensive quickly. It must either have a 50 Ohm input or must be used with proper termination.
A high voltage generator with about 200V output (it needs to supply little current).
An avalanche transistor pulse generator. Sounds much more complicated than it really is.

The scope

I'm going to use a Hantek DSO5202P scope:

This is how a pulse and it's reflection looks like on the screen:

An Old HP 1745A will work just fine:

I think this is prettier :) Pulse and reflection without termination:

...and with proper termination:

The high voltage

I built the high voltage generator myself. The first version was a simple Zener diode shunted uncontrolled boost converter. This one pulled about 25mA out of a little PP3 9V battery. Googling "9V battery discharge curve" and looking at the first few hits hints at about 20 hours of service time.

The current could be reduced to 13mA by redesigning the circuit. This is almost the half of the original current draw, and the service life is likely more than doubled. The new version is actively controlled: one the output voltage is sufficient to open up a chain of Zener diodes, the duty cycle of the PWM signal driving the booster is reduced.

Here's the final version of the circuit:

This was captured and simulated in LTSpice.

The LTSpice files for this circuit can be found here: https://github.com/netom/high-voltage

The circuit charges C4, a 100nF capacitor. Make sure to use one that is rated more than 200V, I used a 630V model just to be safe.

The high voltage is generated by Q3 and L1. Q3 is driven by a square wave of varying duty cycle between 50% and 0%. The mosfet opens and closes quickly at about 7Khz. L1, the 10mH inductor kicks up the voltage on the drain of Q3 when it closes. The energy of the kick is determined by how long Q3 has been held open, and that is in turn controlled by the duty cycle of the square wave on it's gate.

The driving signal is provided by the 555 integrated circuit. It is wired as an astable multivibrator. The ICs CV pin (control voltage) is modulated by Q2.

Q2 is opened if the voltage on the output (across C4) reaches a voltage where the Zener diode chain - consisting of D3, D4, and D5 - and the BE junction of Q2 is opened. Current flows through R6, D5, D4, D3 and the BE junction of Q2 to the ground. The transistor starts to conduct, and reduces the on the CV pin. (Inside the 555 the CV pin is connected to a simple resistive voltage divider.)

With the voltage on the CV pin dropping, the 555 reduces the duty cycle of the square wave. Q3 conducts for a shorter time, L1 will provide less of a kick, delivers less charge to C4, reducing the voltage.

When the output is not loaded, this control loop keeps CV fairly low, close to the saturation voltage of Q2, about 0.1V. Q2 will pull about 1.8mA out of the CV pin to achieve this. Assuming an "average" 2N3904 transistor, the base current will be about 12uA. R6 will help limit the current through the diodes and the base of Q2, and provides low-pass filtering together with C3.

Q1 provides feedback by liting up or closing off D1. The LED is labelled as "CTRL" on the box, it means "attention, ConTRLol voltage is too high". I choose a random small red LED, the part number on the schematic is just on other random one from LTSpice's library.

This LED turns on whenever the control voltage is large enough to open Q1 thorugh the voltage divider formed by R4 and R5. D1 is therefore OFF, if the output voltage can be maintained with a low duty cycle PWM, and therefore low current. Presenting a large enough load to the box will cause this LED to turn on.

If there is significant current draw, D2 lights up. This makes this power supply an excellent cable tester: connecting an unterminated cable to the output should NOT cause D2 to turn on. A shining D2 means that there's significant current draw from the output, therefore the cable is either shorted, or it's leaking a bit at high voltages (water or dirt in the connector, half-broken shielding, etc).

It is important to place C3 "after" the diodes, to Q2's base. Placing it next to R6 will cause loss of precise voltage control. The screenshot below was recorded on an oscilloscope measuring the voltage between the ground and the conductor between D3 and D4. (Measuring the voltage of the conductor between D4 and D5 yields similar results.) Notice the sawtooth-ish waveform:

The jumps on the waveform are cause by D4 or D5 going into conduction quickly, draining charge from C3. Since high voltage Zener diodes conduct in avalanche mode, they might have a negative resistance region. This is seems to be a less-known fact, I found little information on the subject. (Here is some: https://www.semanticscholar.org/paper/Negative-resistance-characteristics-of-Zener-diode-Zhang-Takaoka/3a041fb37b8744138edda1c9711b9b0d2ca096e2)

So when the diodes just start to conduct, they migh go into negative differential resistance mode, and deliver a quick burst of charge. These bursts are "kicking" on Q2's base, opening the device. The CV pin is yanked low.

Following the kick, the voltage on C3 drops, and conduction stops briefly. The CV pin is allowed to recover. This results in a jolt of power to be delivered onto the output, causing the CTRL LED to turn on frequently, and shine dimly. The output voltage is noisy, and the overall current drain is higher than it should be.

Placing C3 right to the base of Q2 means at one hand less charge for the avalanching Zener diodes to work with, and provides some buffer at the base of Q2 where the avalanche noise arrives. The right placement of C3 therefore reduces the avalanche noise by not being upstream so not providing power to the avalanches, and second by being at the base of Q2 and providing a reservoir of capacitance where the (much smaller) avalanches of charge may arrive without causing much disturbance.

The output of the device is a BNC socket. It is connected across C4, the hot center conductor is connected via a resistor. This can be as large as 100K if the application allows it. A bleed resistor across C4 is also helpful, even if it as large as 100MOhms. I do recommend using a 100K output resistor and a 10-100MOhm bleed resistor across C4.

The device was built in a box made out of single-sided copper clad FR4 sheets. Note that the hot part of the socket is hard to touch, but not impossible to do so:

The ciruit itself was built on a piece of unetched PCB. One side is ground, the other side is +9V. Most componets were soldered onto / over the ground side. This is the first version with simple shunt regulation and discrete transistor astable:

Yes, I did touch hot wires accidentally. Yes, it hurt a lot. It's no fun, don't do it.

The pulse generator

Please excuse the hand-drawn schematic:

I didn't bother to draw it in LTSpice, because LTSpice doesn't simulate avalanche effects.

I'm not going to go into details regarding the operation of this circuit. I just really would like to give an overview and some building instructions so it can easily be reproduced. If you'd like to know everything, take a look at this great article by Kerry D. Wong explaining it all:

http://www.kerrywong.com/2013/05/18/avalanche-pulse-generator-build-using-2n3904/

For an even more detailed description he directs us to this paper:

https://icecube.wisc.edu/~kitamura/NK/Flasher_Board/Useful/research/RSI02253.pdf

For the avalanche transistor we're going to use a simple 2N3904. Other transistors will probably work unless they have a CE breakdown voltage in excess of 150-200V. The collector gets the 200V voltage via a 1MOhm resistor. A small capacitor of 10pF is also connected to the collector and the other terminal of the capacitor goes to ground.

Although the 2N3904 is rated for 40 volts, most pieces will break down near or even well over a hundred volts. Don't be surprised of the first one out of your drawer can withstand more than a hundred and fifty!

The capacitor should be one with very low inductance and low ESR. Small, high-frequency NP0 types are probably fine. I used a silver mica capacitor because I had some laying around.

The base of the transistor should be connected to ground via a 22K resistor. The resistor value was determined by trial and error. This value seemed to achieve the narrowest and tallest pulses (beside of just working at all, of course) out of a handful of other values.

The circuit is working with frequencies in the multiple hundred megahertz range, so keep every connection as short as possible and/or use transmission lines of proper characteristic impedance and terminations.

The final device has three terminals:

One for the high voltage power supply
One for the pulse output to the measured feedline
One for the oscilloscope

I choose to build this circuit into a small, shielded box with three BNC sockets. The "pulse out" sockets' hot pins touch at the middle. The transistor is soldered there with as short a lead as possible. The capacitor and R2 are also soldered with short leads. The lead lengths of R1 are of little significance.

Pack everything real tight to avoid reflections that smear and blunt the nice and sharp peak.

The generator itself is also built into a box made out of single-sided unetched PCB sheets.

The circuit works like this:

C1 is charged through R1 with about 200 volts, and therefore - at the beginning - 200 microamperes. The avalanche breakdown occures at a random voltage somewhat (even substantially!) higher than what the part is rated for.

This throws the transistor wide open, and it dumps the charge in C1 onto the 50 ohm feedline. Since the line is 50 ohm in both directions, the transistor will see a 25 ohm resistance (and some inductance because of the leads and wires of and inside the package). C1 empties quickly, the current stops, the transistor closes. C1 starts to charge again.

When the transistor is closed, it presents high impedance to the feedline, causing virtually no reflections. The signal can "pass under the transistor" unchanged.

The pulses occur in a somewhat random fashion around a characteristic frequency. This frequency is influenced by R1, C1, the input voltage, and the actual breakdown voltage of the transistor.

The time between the pulses can be calculated like this:

t_fire = - log(1 - V_breakdown / V_input) * R1 * C1

With my particular 2N3904 with 160V (!) actual breakdown voltage, 200V supply, 1Mohm resistor, 10pF capacitor:

- ln( (1 - 160 / 200 ) * 1 * 10^6 * 10 * 10^-12 = 16.1 * 10^-6

or 16.1 milliseconds, wich means about a 62 kilohertz pulse train.

The frequency of the output can be changed by changing R1. Larger R1 means lower frequency.

Measurement setup

To measure the cable length or valocity factor, the setup should be similat to this:

The generator is powered by the HV power supply via a short pigtail. Don't use a long one, since these cables act as capacitors, and might shock you when charged up.

The two "pulse out" sockets are connected to the cable under test and the oscilloscope. It doesn't matter which one is which.

My oscilloscope only has a high impedance input, so I needed to terminate the cable with a BNC tee adapter and a 50 ohm terminator.

The first pulse is coming straigh from the generator. The second one is being reflected off the unterminated end of the cable.

Look what happens if we terminate the cable properly:

Shorting the end of the cable causes the reflected pulse to be inverted:

I had a few ElectroBoom moments with this experiment. I can't whole heartedly recommend this exact experiment setup. If you don't know why, you shouldn't build this. :)

Measuring cable length

A piece of RG-58 has a velocity factor of 0.66. The question is, how long is it, if the scope screen looks like this:

The time between the pulses is 164 nanoseconds. Since the velocity factor is 0.66, the speed of signal propagation is 198e6 m/s.

So in 1 second, the signal travels 198*10^6 meters. In 164 nanoseconds, or 164*10^-9 seconds, it travels:

0.66 * 300e6 * 164e-9 / 2 = 16.236 ~= 16 meters and 23 centimeters.

The velocity factor is multiplied by the speed of light to yield the real signal propagation speed. Multiplied by the measured time we get length. Dividing by two is necessary since the signal travels along the cable twice.

Measuring velocity factor

The velocity factor can easily be calculated by measuring the cable length and the time it takes for a pulse to travel along it down and up again.

Using the numbers above, if I have a cable 16 meters and 23 centimeters long, and I measure 164 nanoseconds between pulses, then I can calculate the velocity factor like this:

( 16.23 / 300e6 ) / ( 164e-9 / 2 ) ~= 0.66

First the cable length is divided by the speed of light to yield the time it takes for and electromagnetic signal to travel the given length in empty space. This time is divided by half of the measured time (since the signal travels along it twice).

Installing Kubernetes on my Ubuntu laptop

2019-09-29T02:03:00.001-07:00

Why?

Since the company I'm working with uses Kubernetes in production, I like to have a sandbox handy to try upgrades and just for general experimentation.

Prerequisites

I'm using Ubuntu 19.04 "Disco Dingo".

I'm going to install Kubernetes v1.15.4. The current newest version works a bit differently that makes the latest stable flannel release not to work properly. The procedure below can probably be repeated with now-current (in reader-time :) ) versions as long as they're compatible.

Docker

The docker I choose is version 18.09. Check the supported docker version with your Kubernetes version. Do not skip this step. I found this information in the release notes for v1.15: https://v1-15.docs.kubernetes.io/docs/setup/release/notes/

Flannel

I'm using flannel with the local installation. The latest stable at this time is v0.11.0. I found no information on the compatiblity of different flannel and Kubernetes versions, so I just tried the latest of both and I got lucky the first time.

If I'd start the installation today, I'd be out of luck since flannel v0.11.0 does not work with Kubernetes v1.16.0. The flannel manifest on the master branch works fine, but I'd rather use a stable release, and hence I'd go back to use Kubernetes v1.15.4.

The problem with flannel v0.11.0 and Kubernetes v1.16.0 is that it stopped serving certain deprecated APIs, namely DaemonSet has been moved from extensions/v1beta1 to apps/v1beta2 (see https://kubernetes.io/blog/2019/07/18/api-deprecations-in-1-16/ for more details).

Even after fixing the v0.11.0 flannel manifest, the CNI wouldn't work properly, and "kubectl describe node <nodename>" shows "network plugin is not ready: cni config uninitialized". This is probably an other incompatibility between my particular flannel - Kubernetes versions.

Network setup

I'd like to run the (one-node) cluster on a laptop. This laptop may or may not have network connectivity, and both the public facing interface and IP address will change a lot.

Unfortunately Kubernetes does not like to work with loopback interfaces. I couldn't get Kubernetes

to use the loopback address neither as the advertised API server address nor as the node IP.

So I needed an interface with a static IP regardless of Internet connectivity.

Enter the dummy interface.

A dummy interface on Linux is really just another loopback device: it does the same thing really without actually being a loopback device. This is important, since I could find lines in the Kubernetes code that were meant to actively avoid using the loopback device. Since a "dummy" is not "loopback", Kubernetes will be happy with one, and will need little persuasion to use this dummy interface with a static address.

So let's edit /etc/network/interfaces and add these lines:

auto aaa0
iface aaa0 inet static
address 172.31.0.1/16
pre-up modprobe dummy; ip link add aaa0 type dummy
post-down ip link del aaa0

I'm using a name that will make the interface appear on the top of the list provided by "ifconfig". I don't know if this has any importance in our case. You could probably use any name.

Docker setup

Let's install the appropriate docker-ce version, but first let's make sure no conflicting version is installed:

$ sudo apt remove docker docker-engine docker.io

$ sudo apt install apt-transport-https ca-certificates curl software-properties-common

$ curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -

$ sudo apt-add-repository "deb [arch=amd64] https://download.docker.com/linux/ubuntu cosmic stable"

You can view the available version with apt-cache madison:

$ apt-cache madison docker-ce

docker-ce | 5:19.03.2~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:19.03.1~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:19.03.0~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.9~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.8~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.7~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.6~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.5~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.4~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.3~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.2~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

docker-ce | 5:18.09.1~3-0~ubuntu-cosmic | https://download.docker.com/linux/ubuntu cosmic/stable amd64 Packages

I'd like to use version 18.09, so I'll issue the following commands:

$ sudo apt install docker-ce=5:18.09.9~3-0~ubuntu-cosmic

$ sudo apt-mark hold docker-ce

Let's create the file /etc/docker/daemon.json with the following content:

{
"exec-opts": ["native.cgroupdriver=systemd"],
"log-driver": "json-file",
"log-opts": {
"max-size": "10m",
"max-file": "10"
},
"storage-driver": "overlay2"
}

The most important part is the cgroup driver set to systemd. I'm going to use overlay2 as the storage driver merely because it works and does not require any further setup. Logging will take place in the JSON format, and log-opts will prevent docker from keeping too much of them.

Since this is my laptop we're talking about here, I don't like docker or Kubernetes to start automatically, I don't need them all the time, especially not when I'm using the laptop while travelling or otherwise being contrained by the battery, so I'll go ahead and prevent docker from starting automatically, and also restart it for the above configuration to take effect:

$ systemctl disable docker
$ service docker restart

Docker is ready and should be running (check it with "service docker status").

Installing Kubernetes

Let's add the appropriate public key to apt's keystore, add the repository, and install the correct version. (Hint: use apt-cache madison to check for available versions).

$ curl -s https://packages.cloud.google.com/apt/doc/apt-key.gpg | sudo apt-key add

$ sudo apt-add-repository "deb http://apt.kubernetes.io/ kubernetes-xenial main"

$ sudo apt install kubelet=1.15.4-00 kubeadm=1.15.4-00 kubectl=1.15.4-00

$ sudo apt-mark hold kubelet kubeadm kubectl
$ sudo systemctl disable kubelet

That's it, we have all the software installed on the local system for Kubernetes to work.

Setting up Kubernetes

I'm going to use kubeadm to take care most installations steps. Let's write out the default configuration first:

$ kubeadm config print init-defaults | sudo tee /etc/kubeadm.conf

Open that file in an editor, and look for the "networking" key. Add the "podSubnet" under it.

networking:
...
podSubnet: 10.244.0.0/16
...

At the end of the file, add the following snippet:

---
apiVersion: kubelet.config.k8s.io/v1beta1
kind: KubeletConfiguration
failSwapOn: false
evictionHard:
imagefs.available: 5%
memory.available: 100Mi
nodefs.available: 5%
nodefs.inodesFree: 5%

This will make sure that Kubelet will start with swap on.

The other thing I'm going to deal with is the node IP. Kubelet will still going to use the public-facing interface (eth0 or wlan0 in my case) and the IP address of it as the node address. Kubelet's documentation mentions the --node-ip option.

Reading kubelet's systemd service unit file revealed that extra options can be passed to kubelet via the KUBELET_EXTRA_ARGS variable. The variable should be defined in /etc/default/kubelet:

KUBELET_EXTRA_ARGS="--node-ip=172.31.0.1"

We can now fire up kubeadm and it will set up our single node cluster.

$ sudo kubeadm init --config=/etc/kubeadm.conf --ignore-preflight-errors=Swap

I need to use the --ignore-preflight-errors=Swap option to tell kubeadm that it is OK to proceed with swap enabled. Kubeadm and Kubelet are two independent components, each of them must be told to ignore swap being turned on.

I'm going to get warnings about the swap, and that docker is disabled, but that's OK.

To be able to use kubectl I need authentication. A configuration file with the appropriate credentials is written to /etc/kubernetes/admin.conf. Copying it to ~/.kube/config enables me to use kubectl:

$ mkdir $HOME/.kube
$ cp /etc/kubernetes/admin.conf $HOME/.kube/config

The cluster might need a minute or two to start up, but after that, I should see some activity:

$ kubectl get -A pods -o wide

There sould be several pods running, and coredns pending.

$ kubectl get nodes

reveals that the node is not ready yet. The problem is that I didn't install any CNI yet. So go ahead and let's install flannel:

$ kubectl apply -f https://raw.githubusercontent.com/coreos/flannel/v0.11.0/Documentation/kube-flannel.yml

The flannel manifest - among other things - contains DaemonSet definitions for kube-flannel-ds. This pod should start up shortly, and coredns should follow in a minute or so.

The cluster is ready for use. Just let's not forget to start it manually when needed:

$ service docker start
$ service kubelet start

The cluster need a minute or two to start, be patient. When not needed, the cluster can be stopped:

$ service kubelet stop
$ service docker stop

The last command will stop any running containers, so do use it to completely shut the cluster down.

Building a radiation detector from scratch 4 - Detector MK II

2019-04-11T03:34:00.003-07:00

Since I'd like to keep up with the head-spinning pace of interesting articles, let me present you this year, :) the

In the last episode I described an ion chamber radiation detector. It works very well: stable and sensitive enough for doing simple experiments.

It can be improved though in several ways. Here follows a series of ideas and a circuit diagram. :) I did not build this yet, but I'll build it "soon". In this decade, or the next. :P

High voltage

The voltage potential of the outer shell of the detector (the tin can) could be raised up to several hundred volts to provide more sensitivity.

Teralab did wonderful experiments: http://www.teralab.co.uk/Experiments/Ion_Chamber/Ion_Chamber_Page1.htm.

According to their results, it's well worth increasing the voltage between the transistor's base and the outer shell up to 200V, after that we get diminishing returns.

Raising the voltage could be done in several ways in practice. One could by a DC-DC converter module used to charge flash capactiors, or build a (rather large) charge pump, homebrew a simple boost converter, drive a PCB mains transformer in reverse, etc. Just make sure the voltage source is stable enough at least in the short term. Any noise from the supply will be picked up by the very sensitive detector circuit.

Isolation and safety

Working with voltages in excess of a few dozen volts poses a shock hazard. Careful measures should be taken to minimize the danger of shock during work and use.

It's best to put everyting into a solid metal housing, and don't let any leads or pins out that has dangerous voltages on it.

Because of these, I decided to use optocouplers as the output on a separate battery, and also to separate the high voltage power supply. This makes it possible to build a modular solution, whereby you can choose a separate high voltage (or not so high voltage) power supply that is designed and built separately.

One can use the internal battery also simply by shorting out the high voltage power supply connector. Since under no condition should any significant current flow through that connector, a simple 50 Ohm off-the-shelf termination resistance can be used also.

Below is the schematic of the Mark II Ion Chamber Radiation Detector:

Single-ended output

A simple operational amplifier circuit might be used to deliver a single-ended voltage output. The OPA could use it's own power suply, or utilize the optocoupler's battery. Therefore it is recommended to create three output pins / sockets / leads: "output" (left resistor top lead), "reference" (right resistor top lead), and "Vcc" (optocoupler Vcc).

Resistors should be choosen so the voltages respect the OPA's capabilities. The "SingleOut" output will be our single-ended output. Since this circuit can only output positive voltages, "SingleOut" will peg to zero if "Output" goes below "Reference".

This should't happen in theory, since there always will be a slight background radiation, and therefore some current in the chamber. If the circuit starts to drift however, it might happen, and ruin measurements in case of an unattended setup. Since the last version of the chamber proved to be surprisingly stable, I believe this will hardly be a problem.

Data logging

There are a lot of possibilities for recording and/or transmitting voltage data. The single-ended output is easily interfaced with and arbitrary device capable of measuring and displaying / logging / transmitting voltages. This can be an analog uamp meter with a suitable resistor in series, a digital voltage meter module, a DVM, and arduino with an SD-card or Bluetooth module, etc.

I'm going to use the MK II with a DVM and probably try to hook it up to an Arduino Nano, to interface it with a computer throug USB and log data, and might even set up the Arduino to log the data to an SD card.

Next time we'll take a look at the built detector and the measurement results.

Contacting the International Space Station

2018-01-26T07:07:00.003-08:00

The International Space Station (ISS) is a huge (>400 tonnes mass, >900m3 pressurized volume) habitable satellite on low earth orbit. It orbits the Earth at about 400km high with a speed of 7.67km/s (27,600 km/h), and goes around it in about every 93 minutes.

It's permanently crewed, can support up to 6 astronauts, and provides a platform for many scientific experiments. Besides astrobiology, astronomy, materials science, and many other fields, it hosts experiments for the ARISS - "Amateur Radio on the International Space Station" project.

ARISS experiments give opportunities to observe or even contact the station with simple amateur radio equipment.

Activities and Equipment

There are various activities onboard that can be monitored, or taken part of with relatively simple equipment. No large parabolic antenna and automatic tracking is needed.

At the time of writing this post, the ISS is 403 Km high. The figure above shows how it's distance can be calculated when it just appears on the horizon. At that time, it is merely 2300 Km away. This is the maximum distance we need to cover.

The calculation goes like this: D is where we stand, E is the ISS, C is a point directly under it, A is the centre of the Earth. Earth's radius is about 6371 kilometres. The AD radius is perpendicular to the plane tangent to the Earth's surface at point D. E is being on the horizon, ED is perpendicular to AD, so AED is a right angled triangle, the distance we need to calculate is ED. AD is known, AC and EC, therefore AE is also known. Pythagoras will do the rest.

In practice, you only want to try and contact / listen to the station when it is at least 10-15° up, and it is somewhat closer.

That does sound like a great distance. However anyone who participates in amateur radio knows that such a distance can easily be covered with a couple, or couple of dozen watts of power and cheap antennas even on the shortwave bands and with skywave propagation.

Since the ISS is high above, there's nothing between us to shield the signals. The ARISS experiments are usually working on the 2 meters (145 MHz) and 70cm (433 MHz) bands. It is quite practical to build or buy good directional antennas for these bands. A Yagi-style one with a reflector and two directors yields excellent results on 2 meters, a few more on 70 centimetres is probably a good idea.

The mode of operation on these bands is almost always FM, so a simple handheld transceiver (or scanner) will do the job just fine. I use my Kenwood TH-D72E. This is a more pricey HT, and it's a very good one, but a cheap one would also do OK.

All in all, what you really need is a handheld radio or scanner, a portable (not too large) Yagi antenna for the band, and you're good to go.

...but what can you do with it?

Contact with ground crew

Astronauts sometimes use simple FM radio to keep contact with the ground crew in Russia on 143.625MHz.

An example of receiving these signals can be seen here (and may videos on YouTube):

School contacts

The astronauts onboard often participate in events called "school contacts". During such an event a radio link is established between a school and the ISS. If the contact is voice only, then it is often made at 145.800 MHz, and can easily be listened to with the setup described above.

Note that during the event only the ISS can be heard, the school will very likely be out of range - to be more precise - be obstructed by buildings, hills, or the curvature of the earth.

An example of receiving the ISS during a school contact at 145.800 MHz:

SSTV

On special occasions the ISS transmits images at 145.800 MHz using FM and SSTV, often in the PD180 mode.

An SSTV image is just a burst of sound of almost musical quality. To decode it and turn the sound into an image, the radio must be connected to a computer or smart phone, and a decoder program must be running.

I use a SignaLink USB device (http://www.tigertronics.com/slusbmain.htm) to connect my handheld Kenwood TH-D72E radio to my Android phone with an USB-OTG adapter.

I use QSSTV on the PC and an app called Robot36 on my Android phone for SSTV.

Space Comms on YouTube has a nice video of receiving an SSTV transmission :

Here's the image I managed to capture during the last activity:

APRS

APRS is a method of transmitting small data packets over radio, most of the time this means 2 meters, and FM.

APRS nodes may form a network where each node might repeat certain packets, so they jump from node to node and can cover great distances despite each node being a simple, low-power transceiver.

It is most often used to transmit position and some textual information. This can be observed on https://aprs.fi/, a site showing currently active APRS nodes, and lots of information about the network.

There is an APRS repeater (or digipeater) onboard the ISS, operating at 145.825 MHz simplex (this is both the uplink and downlink frequency).

Sending APRS packets via the ISS counts as a two-way contact, and you can get a proper QSL card because of that:

I also made a video of the contact. I used the radio's internal APRS capabilities.

Two-way voice contacts (proper QSO-s)

Sometimes, rarely, it is possible to make real, two-way contact with someone onboard the ISS.

Astronauts of course have a LOT of work to be done on the station, and they have very little free time. As far as I know, the last astronaut to operate the HAM station on board was Colonel Doug Wheelock in 2010. KF7ETX David's dedication were rewarded with great luck as he managed to make a QSO with Col. Wheelock. See:

Usually during such contacts the uplink and downlink frequency is different to prevent interference. In this case uplink was 144.490 MHz and downlink was the usual 145.800 MHz.

Good luck chasing the 'station and 73!

Building WSJT-X 1.7.0-rc1 from sources on Ubuntu 16.04.1 Xenial

2016-10-27T04:10:00.004-07:00

WSJT-X is a great desktop application for communicating via radio using various weak signal modes. I use it on the HF bands to contact amateur radio operators literally all over the world.

The new 1.7.0-rc1 alpha release features a new algorithm for decoding, and it is quite a bit more effective than the previous 1.6.x versions. One of the most important parts is decoding Reed-Solomon codes. The previous versions used a proprietary decoder, this one uses a new, open-source method.

Since I work a lot in JT65 and JT9 on short wave, building the new version definitely worth the struggle.

I've set up an LXC container to start fresh. Repeating the commands below will get you a fresh and crispy wsjtx binary (and others), along with the documentation.

Let's start with creating a new empty directory. I use a directory named "install" to download and build various packages.

$ mkdir install
$ cd install

We'll put everything there. Before we start, let's install packages necessary for building hamlib and wsjt-x. This command will download and install quite a lot of packages. Go grab a coffee.

$ sudo apt install man git subversion ca-certificates build-essential autoconf automake libtool texinfo gfortran asciidoctor libfftw3-dev pkg-config qtbase5-dev qtmultimedia5-dev libqt5serialport5-dev asciidoc

Install a recent version of hamlib from github:

$ git clone https://github.com/N0NB/hamlib.git
$ cd hamlib
$ git branch for-wsjtx
$ git checkout for-wsjtx
$ git reset --hard f778fe1677bffed68d52f04a440e887f249da56b
$ ./autogen.sh
$ ./configure
$ make
$ sudo make install
$ cd ..

Fetch WSJTX 1.7.0rc1 sources from SourceForge:

$ svn co svn://svn.code.sf.net/p/wsjt/wsjt/tags/wsjtx-1.7.0-rc1

Create a directory for the release and build the software:

$ cd wsjtx-1.7.0-rc1
$ mkdir release
$ cd release
$ cmake ..
$ sudo cmake --build . --target install

The binaries will be installed into /usr/local/bin. This is normally in the $PATH environment variable, so wsjtx can be launched with:

$ wsjtx

Building a radiation detector from scratch 3 - a stabilized detector

2016-10-14T07:59:00.001-07:00

In this chapter I'm going to show you how to build an atmospheric ion chamber for detecting radiation. The detector will have little drift, can be set to zero, and can be used to compare the radioactivity of samples, measure weak sources, or follow the decay of short half-life sources for hours or days even.

These features all require that the detector to be stable, and the readings don't drift around with temperature.

Temperature drift is very likely the worst offender to our previous ion chamber design, so we're going to deal with that one this time.

An improved circuit

The next design will deal with the temperature drift by simply duplicating the amplifier, and use only one for detection. The other one will not be connected to anything, and will only provide the zero-point current, and temperature-dependent error.

We will simply subtract the output of the duplicated circuit from the detector circuit to give us a reading without the baseline current and the temperature error.

Since no one will be able to build two EXACTLY matching amplifiers, we'll provide a way to cancel any small residual error, and match the reading to zero at background radiation levels.

In the previous design we've used a DVM's voltage metering function to read really small currents. This can't be done simply enough with two matching amplifiers, so we're going to use 3 transistors per amplifier, a total of 6 transistors.

The image above is a snapshot from an electronics simulation application called LTSpice. It's freely available from Linear Technology: http://www.linear.com/designtools/software/.

This application helped a lot in improving the circuit, especially the response to changing input (e.g. fluctuations in radiation). It also doubles as a general schematic capture tool.

The circuit is basically a Siamese Darlington on steroids. Q1-Q2-Q3 forms a "Darlington triplet". This triplet connects to the ion chamber probe. Q4-Q5-Q6 forms a reference stage.

Now, there is a catch to Darlington pairs and triplets. Since the hFE of a transistor at very low currents is smaller than at the current it was designed for, simply connecting two transistors won't result in a gain that is exactly the product of the hFE values of the two transistors measured independently.

A three-stage "Darlington" like this will have much less amplification then the hFE specified in the datasheet on the third. Unfortunately the simulator program I've used - called LTSpice - doesn't seem to account for this effect. I've built the circuit using the simulated results nevertheless, since I had no other source of information at that time.

I've tried quite a few variations and the above version gave the best results. I've tried placing the resistors between the emitters and the ground, but it gave a very slow (simulated) response. The circuit described here gave reasonably good step response.

The graph below shows the voltage across a collector resistor while sending a 1pA (picoAmpere) current impulse that is 0.1 seconds long. It can be seen that the circuit is somewhat slow to respond, but considering the ions' flight time in the chamber this actually isn't a problem.

Because the transitors don't amplify all that much at picoampere currents, the magnitude of the response is actually an order of magnitude lower. Also because of this effect the voltage drop on the resistors will be much lower, since the currents through them will be less.

Since this device won't be used to detect single events, a fast response is not absolutely necessary. It would be great to detect single alpha particles from weak sources or single muons from cosmic radiation, and I still believe it's possible given a proper data logger and some kind of display mechanism. Maybe I can conjure up some solution for this in a future article.

Selecting transistors

Just like in the previous design, choose a bipolar NPN transistor type that has a high "hFE" or "beta" parameter, e.g. it will amplify currents pretty much. Think about 400-800. These types of transistor are cheap and popular. I'll use BC547C-s.

What is even more important is that we have to build two very similar amplifiers. To achieve this, we must select 3 pairs of transistors with very similar hFEs.

Do not try to match all 6 transistors. It'll take a LOT of work. Do your best and select 3 pairs. Each pair should be as close to each other as possible, but you don't need to match across pairs. Buy 30 or 50 transistors, so you'll have as many options as possible. Use a DVM with a hFE measurement function and measure your transistors. Pair up the similar ones. Do try and select pairs that are very similar.

Assemble the circuit so the transistors in a pair will occupy the same position in different amplifiers (one in the detector, and one in the dupe).

Without proper matching the baseline error will be huge, and even after zeroing the reading with a trimmer resistor there will be severe temperature drift.

Building the detector

The circuit was built on the non-coppered side of a small piece of PC board. The components were fixed with super glue.

Make sure the glue does not touch any lead as it might conduct slightly, and utterly ruin the readings.


Final assembly, top view

A battery or power supply can be connected to the tin can (+) and the wire connecting the emitters (-). The DVM should be set to millivolts and connected to the terminals on different resistors where they meet the collectors of their respective transistor (on the bottom leads on this picture).

The transistor pairs were glued together so they stay close together, and therefore will exhibit very similar hFE no matter the ambient temperature. Pay attention to the transistor leads, as this arrangement the pairs are not mirrored, but rather rotated. See these photos and try spotting the difference on each side:

Final assembly, right side

Final assembly, left side

The resistors were soldered so they can easily be replaced. After taking some measurements I've decided to replace the resistors to a 10K fixed and a 15K trimmer instead of using 6.8K.

I've also decided to use two 9V batteries in series as this improved both the response magnitude somewhat.

I've simply poked a hole on the bottom of the tin can, and glued the PC board to it. I've soldered a 2.5cm / 1 inch stiff wire protruding from the board inside the chamber.

Most tin cans are coated with an epoxy resin in the inside, so I needed to grind that away with a piece of sandpaper. Don't make the mistake I did and do this before assembly. ;)

The inside of the chamber with the sanded walls and elongated base lead

The chamber must be shielded and grounded to prevent static electricity to interfere with the readings. If you're lucky and can get a really thin aluminium foil, it won't block even alpha radiation completely. According to my experiments the foil I use blocks around 75% of alpha radiation, which fine I'd say, because alpha is very energetic and it's very easy to detect with this sensitive circuit.

You can use hair-thin wires and solder it on the chamber forming a mesh. This will effectively shield static too, but won't prevent dust, humidity and UV light to get into it. All of these can generate false readings.

Turning it on for the first time

Connect a power supply (or battery) and measure the current uptake with a DVM. It should be around 100uA. Anything between 20uA-200mA is possible and acceptable, and this depends on transistor types and also the specimen used.

Double check that the chamber is shielded and is grounded. Let it sit for two minutes.

Measure the voltage between the appropriate leads of the resistors and ground. You should get two nearly identical values, and preferably around half the voltage of your power supply. A voltage drop of a few tenths of volts will provide a good range and decent sensitivity.

If the voltage is too high, use larger resistors, if too small, use smaller ones. If the voltage is too dissimilar (more than a few dozen millivolts) it either means your transistors are not properly paired up, or something conducting got onto one of the leads. This can be flux, fingerprint or glue. You might be able to remove such stains with isopropyl- or ethyl alcohol, acetone or petroleum ether.

Measure the voltage between the appropriate leads of the two resistors. Now you're practically measuring the difference of the voltage on the resistors. This should be reasonably close to zero, and you should be able to set it to near zero with a trimmer.

You'll never get a perfect zero, and the circuit will drift around a bit, but not nearly as much as a simple one would do. A few millivolts of drift is perfectly fine. I got about 3mV drift in the course of.... like... ever. :) I consider this circuit to be rock solid. :)

It's also worth mentioning that although the circuit doesn't drift much, large jumps of a couple (dozen) millivolts can be observed a few times every minute. This can be from Radon decay that is present in the air naturally. This source of error can be cancelled if the chamber is sealed and left alone so most of the Radon decays away.

Even after such sealing and "ageing" the chamber might be a bit "jumpy" and large spikes might be reported by the DVM. This can be because of gamma or beta rays are hitting the transistor junctions, or because of cosmic background radiation. Muons coming from above can easily hit the camber and cause a nice trail of ions causing these spikes.

It'd be interesting to build several chambers plus a coincidence detector, and having a very own cosmic ray observatory. Truly fascinating!

Measuring samples

This detector provides ample sensitivity. It can detect anything from my small collections of radioactive materials. Thoriated tungsten welding rods, thoriated lantern mantles give very nice readings even through aluminium foil.

Alpha sources are more easily measured through a thin wire mesh with large holes. Alpha particles cannot penetrate more than a few centimetres / inches deep into air, and almost impossible for them to go through any solids or even liquid films. A thin aluminium foil is almost impenetrable for them.

The ion chamber is the most sensitive to alpha and muons. Beta rays also leave thin traces of ions after them, so they can be detected too, but most of their energy will be absorbed outside of the camber, or in the chamber's walls, and not in the air. This energy is lost and invisible for our chamber. Gamma rays hurtle right through the chamber leaving almost no sign of them. They are hard to detect, and only large fluxes of gamma photons can be detected.

My entire collections of materials packed closely together and inside a thick plastic box causes about 10mV of deflection. (Alpha is utterly blocked, a few beta electrons might escape, but most of the gamma radiation is glowing right through the walls of the plastic box.)

Weak samples can be placed inside the chamber which is sealed afterwards. Readings can be monitored over a length of time and an average reading may be calculated.

Be careful not to contaminate the chamber, and clean it thoroughly after placing anything inside.

Future improvements

The chamber could be supplied with high voltage to gather ions more efficiently. Proper data logger circuit may be added. Several chambers could be paralleled and the outputs could be fed into a coincidence detector to build a muon observatory. :)

We'll be back at these soon.

CUDA accelerated linear algebra with Python and Theano

2016-07-11T22:35:00.004-07:00

Theano is a Python module that enables one to construct mathematical expressions with matrices and/or tensors (basically more than 2 dimensional "matrices").

These expressions are than can be evaluated using Python, but Theano can translate the expression into a C program and compile it to binary. This way it can achieve respectable performance.

But wait, there's more! Theano can build the program so certain - or all - parts of it run on a GPU. Yes, on your video card. Modern cards can do calculations in a way that makes them especially fit for doing linear algebra and similar operations. In "similar" I mean the execution of simple operation on lots of data in parallel. A GPU-s can be several (tens or hundreds of) times better, than your CPU.

I'm going to show you how to exploit an NVIDIA GPU, using Python.
Dependencies

On Linux - Ubuntu Xenial (16.04 LTS) - every good story starts with an "apt-get install". Our case is no exception.

First, we'll need to add so:

$ echo "deb http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1504/x86_64 /" > sudo tee /etc/apt/sources.list.d/cuda.list

$ sudo apt-get update

$ sudo apt-get install python-numpy python-scipy python-dev python-pip python-nose g++ libopenblas-dev nvidia-cuda-toolkit gfortran

It's a good idea to install the cuDNN libraries. These are routines for neural networks, but might be useful in other applications too. Theano can check cuDNN availability and make good use of it.

To get cuDNN, you need to register at NVIDIA, and download the package manually. Both can be done here: https://developer.nvidia.com/cudnn.

If all is well, you can download two .deb file for libcudnn5 and libcudnn5-dev (libcudnn5_5.0.5-1+cuda7.5_amd64.deb and libcudnn5-dev_5.0.5-1+cuda7.5_amd64.deb in my case, and at this time). Let's install them:

$ sudo dpkg -i libcudnn5_5.0.5-1+cuda7.5_amd64.deb

$ sudo dpkg -i libcudnn5-dev_5.0.5-1+cuda7.5_amd64.deb

You need to reboot your system to proceed. This is needed so the CUDA enabled drivers and libs will be loaded.

Let's install Theano itself:

$ sudo pip -I install Theano

I've included the -I option so if you already have Theano installed for some reason, it will be re-installed. Internet wisdom says this might solve compilation problems.

Theano depends on numpy and scipy, these will be installed to. To not clutter your system packages all up, it's recommended to use a virtualenv. I might, or might not re-write this tutorial to do that.

On Ubuntu 15.04, this concludes the installation. However on 16.04, Theano must be patched.

Edit /usr/local/lib/python2.7/dist-packages/theano/sandbox/cuda/nvcc_compiler.py and add the following line between os.chdir(location) and p = subprocess.Popen(cmd, ...), near line 360:

cmd.append('-D_FORCE_INLINES')

This magic is needed for Theano to work with gcc 5.4.0, the default gcc on Ubuntu 16.04. Without this, you'll get an error message:

...
/usr/include/string.h:652:42: error: ‘memcpy’ was not declared in this scope
...

shortly after that Theano will tell you that "CUDA is installed, but device gpu is not available". Bummer.

Well, this monkey-patch fixes this. Just make sure you re-patch Theano if you upgrade it and get the error message again.

Great, you can run the test program at http://deeplearning.net/software/theano/tutorial/using_gpu.html to see if Theano can really use the gpu:

$ THEANO_FLAGS=mode=FAST_RUN,device=gpu,floatX=float32 python theanotest.py
Using gpu device 0: GeForce GTX 970 (CNMeM is disabled, cuDNN 5005)
[GpuElemwise{exp,no_inplace}(<CudaNdarrayType(float32, vector)>), HostFromGpu(GpuElemwise{exp,no_inplace}.0)]
Looping 1000 times took 0.531866 seconds
Result is [ 1.23178029 1.61879349 1.52278066 ..., 2.20771813 2.29967761
1.62323296]
Used the gpu

Great!

Happy computing!

Building a radiation detector from scratch 2 - the first detector

2016-07-11T17:16:00.000-07:00

The detector will consist of a tin can, two transistors and a DVM. I will also show how to improve and stabilize the basic circuit by adding an extra transistor and an identical "dummy" circuit to cancel leakage current and temperature dependence.
The circuit will be same as the one on this video: https://www.youtube.com/watch?v=96bybrgI6V0

An ion chamber is "just" a very sensitive resistance (or conductivity) measuring device. It measures the resistance of a gas, in our case, thin air.

Air is not a terribly conductive medium. It's resistance is somewhere between 1.3*10¹⁶ and 3.3*10¹⁶ ohmmeter (http://chemistry.about.com/od/moleculescompounds/a/Table-Of-Electrical-Resistivity-And-Conductivity.htm). 10 volts only causes 8*10^-14 to 3*10^-14 Amperes, or 8-3 femtoamperes. We need a device capable of measuring such tiny currents. Actually since we'd like to detect radiation, we will be dealing with much higher currents, in the picoampere (10^-12) range. With hot samples, currents can go up to tens of nanoamperes (n*10*10^-9). (http://www.teralab.co.uk/Experiments/Ion_Chamber/Ion_Chamber_Page1.htm)

A DVM, or digital voltmeter can measure milliamperes, or 10^-3 Amperes. We still need to get about nine orders of magnitude gain.

We can quicly get an other 4 by cheating, and using the DVM's voltage measuring function to actually measure tiny currents. The DVM has an intrinsic internal resistance, often 10 megaohms, or 10⁷ ohms. If the DVM can read millivolts, that means it will read 1mV if we send 10^-10 amperes through it. That's not terribly smaller than our target of 10^-12 - 10^-15 amperes!

We will jump the gap with two appropriately connected transistors. Meet the Darlington pair.

To build a Darlington pair, connect two transistors together by their collectors, and connect one emitter to the other's base. The circuit with the two connected collectors, the unconnected emitter and the unconnected base leads forms a device similar to a single transistor, only it's DC current amplification constant equals to the two transistors' same parameters multiplied.

To build a good Darlington pair for our purposes, I recommend using cheap, generic bipolar transistors with large "hfe" or "beta" values. The BC548-C device for example is a good candidate, it has a hfe around 800. That is, a typical BC548-C transitor will allow a collector-emitter current 800 times larger to pass if you sent a small base-emitter current through it. Connect two of these puppies into a Darlington chain, and you get 160000, or one hundred and sixty thousand times the current out than the current in.

Fix the darlington pair near the hole on the bottom of a tin can. Solder the collectors to the can. Pass the base lead through the hole. You probably want to solder a stiff wire a few cm long to the base lead. It should be at the center of the tin can NOT TOUCHING IT AT ANY TIME.

Connect a 9V battery's positive lead to the can. Between the emitter lead and the negative battery lead should go your DVM in voltage measurement mode.

There you go, your first ion chamber!

After hooking it up, wait a minute or so to let the DVM reading to stabilize. Carefully move a piece of radioactive material near the tin can. The radioactive radiation will generate ions that will be driven towards either the can or the naked Darlington base. A tiny-tiny current will flow. The transistors will amplify this current, which will pass through the DVM. The high internal resistance of the DVM will cause the still-small current to cause a significant voltage value to be shown. Congratulations, you've just measured some radiation. Cool.

Could it be even better?

You could connect 2-3 9V batteries in series, to improve performance. You should get sharper, larger readings with more batteries. Don't add too much though, after maybe 5 batteries you risk shocking yourself!

Drift and zero reading

Transistors are nasty devices. First of all, they are not linear, meaning they do much more than just multiply their base current with a constant (although that's a good approximation for a number of applications). Their parameters are also dependent on their temperature. This simple detector doubles as a temperature sensor. Which is not good at all. Readings will jump around wildly even if you touch the them for a few seconds. Background radiation levels are also hard to establish and/or compare, since the transistors have a significant leakage current, they allow a small current to flow across their C-E leads even if no current is flowing across their base.

The leakage current AND the temperature dependence be greatly decreased by comparing the current in two identical circuits, one that measures radiation, the other measures nothing. When the temperature changes, the current also changes in both circuits in a very similar manner. The only difference is the radiation, and that can be measured by subtracting the leakage current.

The next circuit will work like this:

Build two very similar amplifiers, that are thermally coupled, meaning they are very close together, connected by a medium that can transfer heat well, so their temperature will hardly differ.
Make the current from each amplifier flow through a pair of identical resistors connected to ground.
Measure the voltage between the hot (non-grounded) lead of the resistors
The change in difference will be proportional to the change of conductivity in the ion chamber.

The resulting circuit will be surprisingly stable. With an extra part or two, it can also be zeroed, e.g. it can be adjusted to show (near) zero at normal background. It is sensitive enough to be used with fairly low activity samples such as thoriated tungsten welding rods, rainwater, and to absolutely go nuts around decent specimens of radioactive minerals.

This device can be used to indicate and compare the radioactivity of samples, and even to determine the type of radiation emitted by the them. Specimen with very short half lives can be monitored to see the exponential drop in activity (like in the case of fresh rainwater).

It is perfect for home and school experiments, or for hunting radioactive material.

Unfortunately the device is probably pretty much non-linear, and most likely would take a complicated extra microcontroller circuit that would correct this error, and a lengthy process of calibration. However, once done, the controller could take the temperature, humidity and driving voltage into account to make a decent guess and bring this device close to a real radiation measurement equipment.

There are other places of improvement too. The voltage applied across the ion chamber could be raised by more than an order of magnitude, to something like 400V. This change would make the device faster (respond with sharper pulses), and more sensitive, since ions and their electrons would be ripped apart by the large voltage potential, preventing recombination, and pulling them quickly to the electrode and the wall.

I'm considering to do both changes to the current design.

Musings on the ADIF file format

2016-07-11T16:51:00.002-07:00

The ADIF file format is a simple way to store and organize Amateur Radio (or HAM Radio) log data.

It is a way amateur radio operators most frequently store their records on the contacts they make (probably after paper).

(For fellow HAMs, my callsign is HA5FTL, see my QRZ.com page for a description of my station and yours truly, and for contact information, if you wish to express your objections and/or approval of one or several of my points made in this article.)

I have to start with a disclaimer.

The Disclaimer

I have to premise that I respect the devotion and countless hours of hard and precise work of my fellow Amateur Radio Operators who proposed, devised, specified the ADIF ADI/ADX file formats, and kept it up-to date throughout all these years.

I do not claim their work was in vain, or it's product is really unsuitable or too inferior to be the practical and enabling tool it really is.

It's just that it could be much better.

So let me try to explain my expressed anger with this file format, and do not take it personally when I bash on it.

Because I will do that a lot.

Why do that? - An introdction for non-HAM programmers

Amateur radio operators are folks who are allowed to operate radio transceiver (a device that can receive AND transmit radio frequency energy) manufactured, modified (or hacked together) for being used by them for a great variety of purposes in greatly many ways. Not only that, they are allowed to operate transceivers that cover a HUGE portion of the whole radio frequency spectrum, down from a few kilohertz (well into audio frequencies), up to "daylight", very, very high frequencies.

The exact frequency bands, usable power, and the way radio waves are transmitted all subject to strict regulations. (They cannot jam television or broadcast radio, military & defence frequencies, etc. basically common sense most of the time). An Amateur Radio License is special, because it grants freedom no commercial radio license can ever do.

With great power comes big screw-up potential. HAMs have to pass not-so-easy exams to be given licenses, but after that - depending on the type of license - they can even build their own radio receiver and/or transmitter (which can be surprisingly simple, (or mindbogglingly difficult) so it's a great source of fun & experience). The radio frequency spectrum is a very tight resource, a lot of player hope to grab a slice. Governments sell these slices at high prices, except for hams: they are given a lot of frequencies essentially free. All they have to do is to tightly follow the rules and cooperate with the authorities if requested.

For governments to be able to monitor their HAMs (the operators) closely, all of them are REQUIRED to keep a record of all contacts (sometimes even attempts) made. The details vary across countries and circumstances these contacts are made or attempted.

Operating an amateur radio station is a wonderfully diverse hobby, with wonderfully and surprisingly diverse bunch of people participating in it. From bright-eyed young girls to old, fat granddads, from musicians to royalties a lot of people can and do find pleasure in random or scheduled radio contacts, special events, local contests, or world championships.
They buy, modify or build their equipment, and talk, use Morse code, or digital data transfer modes, operate a network parallel to the Internet, make contacts to and through the International Space Station, they use repeaters, satellites, stratospheric balloons, or using the ionosphere or even the Moon to bounce their signals toward lands beyond horizon.

To engage in the hobby is truly an amazing experience. It is like having several interesting hobbies all at once. One can always find new, interesting things to achieve or discover.

All these different people with different fields of interests are required to keep records of their contacts, and there would be several good reasons to do so even if we wouldn't care about the regulations. (We do, and very much so.)

To make a lot of contacts, or interesting ones is a great achievement. Prestigious awards are given for those making contacts with the greatest number of operators in distant countries, on islands, or summits, or contacting a station operating only for a short period of time.

Such awards can be applied to by handing in the records of the contacts: the logs, and most often by providing additional piece of evidence (such as logs, or written notes (QSL cards) from the contacted stations).

So what is this ADIF thing?

ADIF stands for Amateur Interface Data Format, and it's the most popular way to send and often to store data related to HAM activities. In practice this is almost always means information about contacts made.

The Good...

- ADIF is a standard.

When it comes to exchanging data, any kind of standard is better than chaos.(NASA was taught a very expensive lesson on this).

- ADIF is plain text

You can open it in your favourite editor, and make changes. This IS a huge plus, as no program is perfect - you can resort to a generic editor should you ever need a transformation your logger application does not support. "Use text, because that is an universal interface" - stands in the UNIX philosophy, for very good reasons.

ADIF is really just a collection of records that describe something, most often contacts between stations. A records has fields, that has a name and MIGHT contain data. A record is basically a set of key-value pairs. Pretty nice.

For me the list pretty much ends here.

...the Bad and the Ugly

I'm going to try to explain this in a way that I hope brings my seemingly subjective spleen-spitting closer to be an objective and constructive criticism, and hopefully point towards a good solution. I'm going to be passionate, because this is what really drives me at this hour.
To be able to explain why am I so overpoweringly repelled by the standard, I'll try to assemble a short list of facts and explanations.

- ADIF is a standard. And it is a BAD one.

This sounds like circular reasoning: it's bad, because it's bad AND popular. I'll nevertheless keep this point, because popularity makes it even worse from a perspective I'm going to take later. For now, let's state that if something is bad, and it's the only path ahead, you have to stumble your way along it.

The solution for this is either to vastly improve the current one, or to throw it out all together, and use a different format - probably an application of a general format - that is accepted widely. Yes, I'm thinking about XML. No, the ADX file format described in the ADIF specification is not XML. (More on that later.)

- ADIF steals programmer's time

That's that particular perspective I'll take now. From this it can be seen how an insufferable piece of software or standard can turn into something objectively ineffective.

ADIF is BAD, because working with the format is a huge cognitive load. It's hard. It has quite a few catches, AND lacks widespread, well-tested programming libraries that work out-of-the-box. If you're writing a program that eats or spits ADIF, you're likely to be forced to roll your own parser or serializer.

Yes, there are a few github projects, there are blogs on how to parse the darn thing in PHP and other languages, but it's not at all "import antigravity". Dealing with ADIF takes time, and it takes a lot of time.

If one uses their time to write ADIF juggling routines, they will have less time and energy for being creative, developing useful features or good-looking and convenient user interfaces.

It also prevents small "fire-and-forget" projects to pop up, because no one can write an ADIF library in 30 minutes, and call it anything near complete or usable (trust me, you can't). There are no "do one thing, do it well" (another piece of UNIX philosophy) tools and shims with narrow, well-defined and tested functionality.

The reason for it is the same as for there are no good libraries really. And this reason is the following:

- ADIF is Hard and Rigid

This is the reason why there's little to no available "canned" software for dealing with ADIF.

The ADIF specification is a big document. Fully understanding and implementing it takes a lot of time. It defines a plethora of data types. A lot of these data types are enumerations, a list of possible values (words) that can stand at particular places in the file or data stream.

One problem with this, is that it rigidly fixes a number of things that should be flexible. A program or library closely adhering to the ADIF specification SHOULD NOT accept a piece of data if it doesn't fit into the given data type.

Let's take the MODE field for example.

Of course any general program or library MUST follow the standard, or it's loses the meaning for it's existence. This means the proper validation of fields, and forbidding any new modes, contests, countries or subdivisions, bands, propagation modes, QSL mediums, etc. to be quickly adopted by the community.

If you develop a new digital mode for example, there will be a significant resistance before it's acceptance, because any existing logger software will decline log entries with your new digital mode. It is currently impossible to properly log contacts in the FSQ digital data transmission mode, because FSQ is not a member of the Mode Enumeration, and log entries containing "FSQ" in the MODE field are declined by logger programs and sites. This clearly holds back FSQ and similarly new digital modes from being accepted. The less stations you can work in a mode, the less likely you'll use it. The less likely a mode is being chosen a station, it's even less likely to be chosen by others. The highly non-linear dynamics of the spread of information transfer modes greatly magnify any such resistance.

One solution could be to modify the standard to be more easily extended and state this possibility clearly in the specification. The effort required to write a log entry with a new digital mode is by no means prohibitive, but others HAVE TO accept it, for even such a small effort as writing a log with a text editor have any point at all. Unfortunately the standard does not make possible, or at least doesn't allow it explicitly.

One could say that ADIF parser implementations are to blame, and we could try to convince the ARRL to make LoTW more flexible and please accept records about contacts in FSQ (or other new) modes. But sadly the ARRL and LoTW (for example) NEEDS the precise mode information to be able to give credits for contacts, because the credits are give per mode: you can get a credit for a phone (SSB, FM, AM), CW (Morse code) or digital contact.

How unfortunate that ADIF lacks tags that describe general facts about contacts, such as whether the contact was made using analogue or digital mode, whether it used frequency shift keying/modulation, amplitude/on-off keying/modulation, phase shift tinkering, uses single or multiple tones, whether the carrier is partially or entirely suppressed, etc.

I bet a sufficiently general and widely usable set of fields could be added so they are in practice (or even in theory) can completely describe a transmission mode, and the current mode enumeration would be a mere convenience, a collection of abbreviations for a set of mode-descriptions with a tendency to appear together in practice.

In case of DXCC entries, countries, subdivisions, SOTA summits, IOTA islands and similar list of possible values there are authoritative information sources, such as the ARRL's DXCC entity list, numeric-, alpha-2 and -3 ISO-3166 country codes, and the official sites of the SOTA and IOTA awards. Such lists take up most space in the ADIF specification. They shouldn’t. There are places these pieces of data should and do live. Mirroring these and similar list are causing inconsistency between the specification, the authoritative sources and/or reality, and keeps new members of these lists from finding their way into the logs of fellow HAMs.

In my opinion, these lists should be replaced with references to the authoritative information sources and the possibility to add new members should be left wide open. Where necessary, a set of fields with describing general properties should be added, to allow passing information (not just data) on yet-unknown situations.

To cut it short: ADIF is hard, because including the enumerations are tiresome and error-prone, and it is rigid be cause these MUST be included by any serious implementation.

The lack of general, widely used libraries seem to be a piece of empirical evidence for this.

- ADIF doesn't natively support Unicode (or at least does it poorly)

The ADIF format is really two file formats. An older "ADI" file format and a newer "ADX". The former was probably modelled after HTML, the latter after XML (ADX IS NOT XML despite what the documentation says, more on the actual formats later).

The spirit of ADI is to use only ASCII characters. I don't know if there are any programs actually enforcing this constraint, but there can be, and if you want a truly compatible implementation, you better not include characters with codes above 127. You have to stuck with the English alphabet, numbers and punctuations/control characters.

Of course the specification can be interpreted so that any byte is allowed as data. (Restricting tag names to be ASCII strings is actually a good idea given ADI is a textual format. It is the most compatible encoding, usable with text presentation/editor software using ASCII, Latin-X, UTF-8 and several other encoding methods / code tables.) If any byte-string of proper length would be EXPLICITLY allowed after tags in ADI files, this problem would quickly be resolved.

Adding an <ENCODING:5>UTF-8 or similar tag into the ADI header would make this misery disappear at once. Encoding is only important, if data is displayed. For data transfer, it's enough to know how many bytes you have, and what are those bytes. An ADI tag contains every piece of information for ADIF to be a truly flexible, binary Amateur Data Interchange Format.

Because of the the *_INTL fields for storing UTF-8 encoded versions of a few fields are unnecessary (and are strangely only part of the ADX file format, which really wants to be XML, and says it's UTF-8 encoded).

Actually there is a recent data transfer and storage format that bears uncanny resemblance to ADI. It is BSON, the "Binary JSON", an efficient binary data format used by the popular document-orinted database server MongoDB. BSON's "records" are called "documents". Documents have fields, with name, data type, and actual data. The length of the data is also stored in fields (but since it is entirely binary, and has no delimiters at all, the length of the field name is also stored, and no extra data is allowed between fields or documents). So this name-length-type-data based principle can work well in practice, "only" the details have to be gotten right.

Of course one can say that the standard is old, however UTF-8 is a great way to store Unicode text data backwardly compatible with ASCII. UTF-8 is ASCII, if you stick to code points below 128.

- ADIF stores the type of the data explicitly

Now it's time to dwell into the actual file format. A piece of data stored in ADIF looks like this:

<TAG:11:S>actual data blah blah lots of extra
characters that do not matter.

There is some metadata (data about data) and the actual data. Metadata is most often stored in tags, similarly to HTML. Except there are no closing tags. This is probably to save space, as most data is represented as short strings comparable to the length of text describing the tag containing the metadata. Since the length of the data is known beforehand it need not to be interpreted in any way, and no escaping or similar trick is necessary. (As it was said above, ADIF can store opaque binary data.)

Between < and > characters there MUST be a tag name, like CALL if the data is a call sign, or RST_SENT if the data is the signal report sent to the other station. If the tag does not store any actual information, than the name alone suffices. (Such tags are the end of header and end of record tags: <EOH> and <EOR> respectively).

If the tag do contain data, the length of the data represented as an ASCII character string MUST be given after the tag name separated from it by a colon (:) character, and written in decimal ASCII digits, in base ten. After the length there MAY be a character - again, separated by a colon from the length - that denote they type of the data.

This feature makes it simple to quickly write a parser that just "explodes" the ADIF string to (<TAG>, data) pairs. Just look for the first <, then read the tag name, length, and possibly the data type, than expect a >, and read the right number of bytes. Repeat until the end of the file.

Unfortunately ADIF, as described here, cannot be cut up into tokens with the usual way (using regular expressions), as - strictly speaking - the language is not regular. This looks like a minor annoyance not really deserving it's own point. The proof would probably be possible by proving that recognizing this language would be "just as hard" as recognizing the language containing only words with the same number of a-s and b-s in it. The problem with this is that it is impossible to write a fast tokenizer with the popular tools (like lex of flex) that can provide high-level tokens and therefore require little logic above the token level.

Low-level tokens like the <, :, and > characters, length, data type and tag names can be recognized, but additional logic is needed to read the number of bytes. This small design feature prevents the use of popular parser generators, and promote DIY implementations that can be buggy and hard to maintain.

Even if you can write this software quickly, you still have to deal with data types.

This data type may be string, if the character is S, a date if it's D, time if it's T, and so on. The standard describes quite a few data types and their format.

This is problem, because for many tags, there is really only one sensible data type to use. Extra effort needed, to validate the type marker.

This sounds like nothing serious, but if you don't work for Microsoft and aim for full interoperability, you HAVE TO do this. In practice, no one really uses the type marker for frequently used, standardized tags. The data type should always be obvious from the field name, and this is how the type of the data is communicated and determined most of the time in practice.

- Awkward data types

There are data types that are non-scalar: lists, even lists of key-value pairs (such as CreditList). Handling these types requires manual labour. A file format describing structured data should provide a small set of simple solution covering every possibility. Tags, and members of an instance of the CreditList both essentially describe key-value pairs. We have two solutions for a single problem. This leads to an unnecessary large lines-of-code metric. Not good.

XML of course could solve this, as it provides the tools for describing nested data.

What can be done?

Every problem I can think of can be efficiently solved by using a format that is a proper, well-designed XML application.

There are good XML parser libraries for any programming language I can think of. These are ready to be used, and are essentially free of bugs (or their bugs and quirks are well-known).

Upon these libraries general-purpose ADIF/XML parsers could be built with relatively little effort. These libraries than can be re-used by application developers, liberating them from the burden of writing their own ADIF parser.

Such generic libraries would work correctly with application- and user-defined fields living in their own namespaces. File format validation could be programmed in a few lines of code given the XML schemas of the core document and the extensions.

Since the effort to actually implement such libraries would be reduced, providing reference implementation and test suits wouldn't be such an outlandish expectation.

ADX is not entirely an XML application.
Okay, but ADX is XML. Or is it? Well, not so much.

For one, namespaces are ignored, killing extensibility. Namespaces are the perfect solution for incorporating application- or user-specific data. The whole <USERDEF ...> and <APP> thing is unnecessary.

A true, extensible, simple XML application would solve all of these problems. ADX is close, let's hope we keep heading that direction.

Folks, let's use XML like it's meant to be. Please.

Ubuntu: running services inside a chroot

2014-07-29T02:57:00.002-07:00

So I got a task the other day: install php 5.5.9 onto a fairly old (10.04.4 LTS "lucid") Ubuntu machine, without breaking the already existing prehistoric php, installed from the default repository.

Also, compiling from the sources was out of question, since my colleges wanted to keep the update procedure as simple as possible.

So how do I install a 14.04.1 LTS "trusty" package onto lucid without wreaking havoc among the packages?

I don't.

First I thought I might install lxc or similar light-weight virtualization solution, but for a single service it seemed to be an overkill. When I saw that the node already using almos all phisical memory alredy, it was completely out of question.

So I decided to build a minimal chroot environment with debootstrap. Unfortunately debootstrap on lucid won't build you a proper (or any) trusty chroot. I tried to install the trusty deboostrap .deb file manually with dpkg, but of course it was unsuccessful. Apparently even the .deb file format is somewhat different between the two versions of the distribution, the package manager never managed to even decompress the archive.

So I just decompressed it on my laptop (running trusty), and compied debootstrap over the lucid machine.

This was a complete waste of time, since the debootstrap script downloaded the .deb files OK, but - guess what - it could not decompress and install them properly. Surprise, surprise.

My last chance was to build the chroot env on my desktop, and use that on the server (fortunately both are amd64).

Here is the debootstrap command I used:

debootstrap --variant=buildd trusty /jails/php559 http://archive.ubuntu.com/ubuntu/

This will download and install a very basic Ubuntu Trusty into /jails/php559.

One quirk is that you need to mount the /proc filesystem before doing any work in the chroot environment:

chroot /jails/php559 mount -t proc proc /proc

After this, you need to install packages like this:

chroot /jails/php559 apt-get install <package name>

As you've probably figured out, running anything inside the new environment can be achieved with

chroot /jails/php559 <command>

Once you installed anything you needed, you can start / stop services inside the new environment. One thing to note is that those services and applications still use the kernel the node booted, so you can't run too exotic stuff inside a chroot, like a very ancient, or too new software, or software compiled for a different architecture. In those cases you need some kind of virtualization.

One problem remains: how to start service inside a chroot?

I wrote a very skinny init script that resides on the host node's /etc/init.d directory and blindly executes service <program name> <command> inside the chroot:

#! /bin/sh
### BEGIN INIT INFO
# Provides:          jail-php559-apache2
# Required-Start:    $remote_fs $syslog
# Required-Stop:     $remote_fs $syslog
# Default-Start:     2 3 4 5
# Default-Stop:      0 1 6
# Short-Description: apache2 running in jail
# Description:       Start / stop the apache2 instance with php 5.5.9
#                    in an appropriate Trusty chroot jail.
### END INIT INFO

# Author: Fabian Tamas Laszlo 

JAIL=/jails/php559
JAILRUN="chroot $JAIL"
SERVICE=apache2

case "$1" in
  start)
 # Ensure that /proc is mounted inside the chroot
 $JAILRUN mount -t proc proc /proc
 $JAILRUN service apache2 start
 ;;
  *)
 # Delegate any command to service command inside jail
 $JAILRUN service $SERVICE $1
 ;;
esac

Don't forget to run

update-rc.d jail-php559-apache2 defaults

after installing the init script.

Building a radiation detector from scratch 1 - the basics.

2014-06-28T08:20:00.002-07:00

Before you begin

If you follow this series of articles and maybe do some research on your own, you'll be able to build a simple radiation detection device and perform some simple, interesting experiments.

I chose to build a simple, but surprisingly effective device called an ionization chamber or ion chamber, which is just a very sensitive current sensor, or resistance measuring device, the difference being only the point of view.

A great source of information on ion chambers is: http://www.techlib.com/science/ion.html

Ion chambers are not Geiger counters, however they work according to the same basic principles, as will later be explained.

You will need to be able use soldering iron, and perform current and voltage measurement. If you don't know how to do these things, it's better to ask someone to show you. Please only perform the tasks written here if you know what you are doing. Soldering requires high temperature. We will work with high voltages at some point, and of course, there will be some radioactivity involved. Any of these things alone can be dangerous, so exercise caution and use common sense. Hurt yourself, blame yourself.

To really build from scratch, one needs to understand the basic principles on which the device operates. I'll describe these basics in two articles, this one will be on nuclear radiation itself, the second will be on basic electronics.

This series of articles should be considered loose notes on these subjects, and will only contain material vital for our purposes that is, building radiation detectors. If you don't understand something, do your research, or ask in the comments.

Bill of materials in advance

I tried to keep the circuits and the overall project as simple as possible, using the most common resources available. Below are an approximate bill of materials. This might change as we go along, but will certainly be enough for the first one or two experiments.

Transistors. I used BC547C types, because of their high gain, and their price, which is next to nothing. You will need 3 in the basic circuit or 6 for the more advanced one, but I recommend you to buy at least 50, since we will need to match a certain parameter of them to be able to get any reasonable stability. You can probably use any NPN bipolar transistors with current gain around 500 in a TO-92 package. We will connect three of these to achieve something around 1 to 2 hundred million current gain.
Resistors. The actual value you might want to use will be between 1 KOhm (kilo ohm) and 100 KOhm (or 1K and 100K). 10K is a safe bet. These are also very cheap, you might want to buy 10 each from the following values: 3.3K, 4.7K, 6.8K, 10K, 15K, 22K, 33K. Ask for through-the-hole, 1/4 or 1/2 watts, metal film types. These are probably the cheapest too. Don't buy SMD (surface mounted) types, unless you really know what are you're up to :)
If you want to build the high precision version, you might also want to buy a trimmer potentiometer, a 10K will probably be all right.
A digital multimeter - Make sure it can measure millivolts too. One with a short circuit indicator will help a lot. Probably this one will be the most expensive item in this list. If you want to build a really cool looking device, buy a 100uA (microAmper) analog current meter too, and use it with a 1K-10K resistor in series, as later will be demonstrated.
A 78L12 integrated circuit device. It should look exactly like the transistors above (it has a TO-92 package), and needed to keep the voltage fairly constant around 12 volts.
Two 1uF (microFarad) capacitors, aluminium electrolytic, 50V, needed for the 78L12voltage stabilizer.
I'm planning to build a high voltage power supply, but I'm not completely sure about the exact design. It probably will be a Dickson charge pump driven by an astable multivibrator.

For that, you'll need two more from the transistors above, they don't even have to be matched.
You'll also need 20 diodes, 1n4148 will be OK.
20 100p (picoFarad) capacitors, at least 50 volts.
one 10nF capacitor, 400V
This version might need a "guard ring", a 1-2cm long coaxial cable will suffice. The exact type of cable doesn't matter much.

A soldering iron. Use low power type suitable for the components above. A 30W iron is a good choice. Choose one with a small, pointy tip (need not to be needle sharp though).
Solder. Use thin, flux core solder.
Empty tins cans, two of the same shape, circular, flat. Try to get ones with 8-12cm diameter. The ones I used contained tuna. When buying the cans, use magnets to check their material. You need cans that stick to the magnet (steel cans). Aluminium cans cannot be soldered (easily), and must be drilled and equipped with screws, washers & pins to create reliable electrical contacts. As long as you can get steel cans, don't bother with aluminium ones.
Some fine sandpaper for sanding the tin cans, since they are likely to be coated with a thin layer of plastic foil or paint. This need to be removed.
A sheet of thin aluminium foil (the thinner the better)
Some electrical tape / duct tape
Some wires
2 9V Batteries
2 Battery clips
Some source of radiation for testing - you might already have some ;)

What is radiation?

The word "radiation" can mean a lot of things depending on the context. It is used in many scientific fields and pseudo-scientific settings. For our purposes the word radiation will mean nuclear radiation, the kind of energy emanation that happens as a consequence of nuclear processes, or those coming from the nuclei of atoms.

We will take a close look the three most common types of nuclear radiation. These three share an important propery: they are all a kind of ionizing radiation.

The makeup of matter

To make any sense of the stuff above, you have to know a thing or two about the structure of matter, about how the everyday stuff you see is built from basic building blocks.

If you've been through elementary school, you probably know that stuff around you and even inside you, is made out of tiny particles, called atoms. Though once thought to be the final, indivisible constitutents matter, atoms too are built from even smaller particles.
First of all, every atom has an outer cloud of electrons loosely jiggling around the small, and very dense nucleus. The nucleus itself can be divided into particles called nucleons of two types: protons and neutrons.

All the chemistry and electronics happens in the outer electron cloud of atoms. Nuclear decay and other nuclear processes such as fusion and fission happens inside the nucleus or between atomic nuclei.

Alchemy FAIL

One very important difference between the chemical and nuclear processes is the amount of energy involved. The electrons are really just very loosely around the nucleus relatively to how densely the latter is packed. This means that the amount of energy in the nucleus is much, much higher than in the electron cloud.

This means that no chemical process can ever influence nuclear processes. This is why alchemists have never succeeded in creating gold by mixing any number of materials together. The energy levels in a beaker are simply much, much too low to possibly be able to influence the nucleus. Given that gold is an element, and has a unique nucleus, it would require changing one kind of nucleus into an other one. Neither ancient alchemy nor modern chemistry had or has any chance doing so.

So it is utterly impossible to influence nuclear events with chemical ones, it is quite possible and natural the other way around. Nuclear processes do effect chemistry and electronics. This is what we will exploit in our nuclear radiation detector.

Electronvolts

OK, we've talked much about energy levels, but never mentioned numbers, just "large" and "small". So, what counts a "large" and "small" energy at the atomic and subatomic level?

First, we need a good unit for measuring energy. The SI unit is joule, which is useful for measuring energies in everyday life. But this unit is too big. A joule is the amount of energy that needed to move something with 1 newton force through a 1 meter path, or the energy you get when a coulomb charge passes through a resistor with 1 volt through it's pins. One would need to accelerate particles to relativistic speeds to achieve such energy levels.

There is a nice, small, non-SI unit of energy that can be used for our purpose. This is the electronvolt, or eV. One eV is very small compared to joule. 1 eV = 1.602177*10^-19 J

One electronvolt is the amount of inertial energy an electron has once it has been accelerated with one volt. If someone would build an electron tube and connect a single volt to it's accelerator plate, the electrons arriving at the plate would have one electronvolt inertial energy.

Visible light comes in packets of energy called photons, with energies between 1.6 eV and 3.4 eV. Red photons have lower, blue photons have higher energy.

When burning hydrogen, the amount energy released is 286 KJ/mol. For one molecule that is 286 / (6*10^-23) = 4.77*10^-22 KJ = 4.77*10^-19 J = 4.77 eV. Such energies are typical in chemistry.

The energy of radiation particles are greater by a factor of hundreds of thousands to millions.

So what nuclear radiation really is?

When an atomic nucleus decay, it turns into an other kind of nucleus. Yes, what alchemy failed to do, happens in nature spontaneously all the time. Unfortunately gold won't form just like that from any cheap stuff, so this fact won't make us magically super rich overnight (or at all). However, the byproduct of nuclear decay is a big burst of energy - nuclear radiation. There are three types of nuclear radiation we will consider here.

Alpha radiation

is a stream of very high speed helium nuclei, consisting of two protons and two neutrons. The inertial energy of these nuclei typically in the ballpark with that of an electron accelerated with a voltage of five million volts, or to put it differently, is around five mega electronvolts or 5 MeVs. This is about two million times higher than the energy of a single photon of visible light (1.6 eV to 3.4 eV).

Alpha radiation can be very harmful if the source gets into the body, but quite harmless otherwise, since alpha particles are stopped by a single sheet of paper, the dead skin cells on your hand, or a few centimetres of air. Alpha particles can only penetrate very thin materials, such as the gold lead in the famous Geiger-Marsden experiment (also called the Rutherford gold foil experiment).

These effects are all because the alpha particles are heavy, charged, and interact quite readily with any kind of ordinary matter, so they deposit their energy into pretty much the first thing they bump into. They can easily rip the electrons from the electron clouds of atoms, and can even fuse with light nuclei (such as aluminium). The products of such fusion are often unstable, and themselves undergo radioactive decay, they can emit neutrons, protons, electrons or gamma rays (see below).

Beta radiation

Beta radiation is a stream of high speed electrons. They are produced together with a so-called anti electron-neutrino (wich is extremely hard to detect), and a proton when a neutron decays.

The typical energy of a gamma particle is around 1MeV, but it can be as low as a few KeVs, or as high as a few tens of MeVs.

Beta rays tend to penetrate deeper into matter than alpha rays, so they are harder to stop. A few millimeters of aluminum, or a centimetre or two of clear plexiglas or polystyrene can stop most of the gamma particles.

One dangerous property of beta radiation is that it can produce secondary brake radiation, or Bremsstrahlung. When a fast electron hits something, it decelerates and some of it's energy is emitted as a high energy photon. This is how X-Ray machines work: they shoot electrons at a metal plate, and the collision produces X-Rays.

The heavier atoms the obstacle the electron hits has the more Bremsstrahlung is produced, so beta radiation is better stopped with materials containing light atoms: aluminium and plastics (carbon, oxygen, nitrogen, hydrogen) are good choices.

As we already mentioned, beta radiation also a kind of ionizing radiation. These high-speed electrons can knock off other electrons from around atomic nuclei, much like alpha radiation, but with much less vigour.

Gamma radiation

Gamma radiation is electromagnetic radiation. It is the same kind of energy than light, or radio waves, only quite a bit more powerful. The typical gamma ray has a few hundred KeVs of energy, but can be much lower or higher than that. This is no way a definition, only a rule of thumb.

Gamma rays interact more reluctantly with matter than alpha or beta rays, but (or because of that) they penetrate deeper. One needs several tens of centimetres of lead to "stop" gamma rays. Even with such shielding, gamma rays cannot be stopped 100%. You can get arbitrarily close to that, but never truly reach it, there’s always a tiny chance of one or two of them slipping through. In practice, a few large lead bricks can protect even highly sensitive equipment from being affected too much by natural background radiation.

Gamma rays can interact with the electrons in atoms by giving them some of their energy probably knocking them off from their orbits. This phenomenon is called Compton scattering or the photoelectric effect.

There are materials that can change their electronic properties when hit by ordinary, visible light because of similar effects. Such materials are used in digital cameras for example.

Other kinds of radiations

There are as many type of radiation as particles - one could even accelerate mercury nuclei and call it "mercury radiation". However, in nature, and for our purposes, these three kind of radiation is all we care about, but there are some other cases that I find worth mentioning.

I already wrote about neutron and proton radiation, but these are relatively rare. Also beta radiation can consist of positrons (anti-electrons), but they will quickly annihilate with a regular electron producing (in the most common case) two photons with an energy of about 511 KeVs.

There are some other exotic particles that do happen in nature, mostly because of cosmic radiation. The most common particle radiation at and around sea levels is the muon. This particle can put out quite a show in spark chamber (look one up at YouTube).

Detecting radiation

All detectors rely on the ionizing nature of nuclear radiation. Our solution will measure the conductivity of air. When air subjected to radiation, some of the air molecules will be stripped of an electron or two. The electrons and the resulting ions will drift away from each other if there's an electric field around.

The conductivity of air is very, very low (it doesn't conduct electricity very well). Even high voltage wires hanging on poles won't cause much current to flow in the air.

However, air can become quite conductive when ionized. Very high voltages can rip electrons off from the air molecules. Such high voltages than accelerate these electrons so quickly than when they encounter an other air molecule, they themselves can knock off more electrons from it. An avalahce effect takes place. This happens during thunderstorms, or most of the time when an arc forms.

Ions can form in the air if there are energetic particles passing through it. If we measure the conductivity of air, we can guess the amount of radiation passing through it.

Ion chamber

We will build a metal chamber, and stick a wire inside it, than apply some voltage between the chamber and the wire probe. If we can measure the tiny current flowing between the probe and the chamber walls, we can see how much ions are in the chambers. Since the currents are in the femto- to picoampere range, we need to amplify the current a few million times.

How can we do that? See the next article.

How does iptables hashlimit module work?

2014-06-10T05:21:00.000-07:00

*UPDATED*

Hashlimit is an iptables module that allows one to define rules that in effect will limit traffic speed (bytes / time unit) or frequency (connections / time unit) per target or origin ports / IPs. The inner workings of this module and / or how to make it work correctly remains a mystery for many.

Hashlimit is also close friends with the limit module, only much more powerful, capable of expressing rate limiting per source IP (for example) in a single rule.

A hashlimit rule doesn't work like the most people I met imagine. The common false picture of iptables hashlimit is that it throttles connections by "slowing down" things somehow. Why would anyone think that? It's just an iptables rule after all: match, or no match.

But hashlimit doesn't work like that at all. Instead of applying friction to the wheels slowing the wehicle down, it just blows the whole wheel up. Instead of "slowing things down", hashlimit drops packets (in the most common use cases).

Well... a haslimit rule actually either mathches a packet or doesn't, just like any other iptables rule. If a hashlimit rule matches a packet, it means that the packet is below (--hashlimit-upto) or above (--hashlimit-above) a certain rate (bytes / timeframe or frequency / timeframe).

You can, of course, create a rule that -j DROP packets that are --hashlimit-above 10/sec effectively prohibiting traffic faster than 10 packets per second.

This means it isn't inherently connection-oriented either. Yes, it can be used together with connection tracking, but it's not mandatory. TCP connection rate can be controlled with hashlimit by simply matching on the SYN flag.

Ok, so hashlimit can only DROP, or ACCEPT a packet, or do the usual iptably things like directing it to a separate chain where it gets logged and then dropped etc. But...

How can this even work?

I mean how can this smoothly limit traffic, not causing connection resets and other weird end-user-deterrent errors?

There seem to be a contradiction between the theory and practice. If hashlimit rules simply just drop packets then why the end user only feel maybe some slowdown? Why there are no error messages saying that the connection was terminated? (Or at least why such thing never happens if the connection limit is higher than a few / min?)

The reason for this is because almost everybody uses TCP. On the web, folks use TCP exclusively. And those who don't, probably do the same thing as TCP:

The TCP protocol sends acknowledgements, so the sending party can keep track of what have arrived and what was probably-lost on the network, and can re-send data if needed.

The typical use-case for hashlimit is to drop the first TCP segment (the one with the SYN flag set) if there were too many in the near past, effectively limiting connection rate. The sending party keeps retransmitting connection requests for a while. The exact timeout and the number of retries are operating system (and configuration) dependent.

An unfortunate consequence of this is if you need to tune your hashlimit tight, you will get dropped packets all the time.

I saw system administrators carefully analysing the syslog, and trying to tune hashlimit rules so no packet drop log entries appear when traffic is "normal", but still trying to hold off simple DOS attacks (like a stuck F5 key).

Needless to say, such efforts are a waste of time. No matter how hard you try, there will be packet drops with hashlimit, and it's OK. TCP can handle that.

Of course if you have to tune limits that tight, you also gonna have a bad time with NAT users too. If your application can't handle at least a few hundred req/s per thread, then in general you gonna have a bad time, and hashlimit won't do much for you. I had an old client once, whose application started coughing at five requests per second. Hashlimit wasn't quite the solution they needed. Such an application will crash an burn as soon as it is introduced to the general public.

Experimenting with hashlimit

*UPDATE* I received a great comment from Tomasz P. Szynalski's. Apparently, I had a mistake in the examples. Quoting what I think is the most important:

The rule "iptables -I INPUT -m hashlimit -m tcp -p tcp --dport 80 --hashlimit-above 20/sec --hashlimit-mode srcip --hashlimit-name http -m state --state NEW -j DROP" is not doing what you think it's doing.

Iptables modules are executed in the order they are given in the rule. Because in the above rule, "hashlimit" comes first, it will process EVERY packet, not just new TCP packets, and every packet will count towards the 20/sec limit.

At the beginning, there is no difference, because all packets are for NEW connections, but later on, if you have an open connection that keeps sending packets, those packets will keep depleting the hashlimit counters. So you may not be able to open a new TCP connection, even though the hashlimit rate says you should be able to.

Instead, you want to FIRST check if a packet is TCP, then check if it's for a NEW connection, and only then send it to hashlimit to be counted.

Thank you sir, I stand corrected. On with the examples:

Let's take a look at the documentation: http://ipset.netfilter.org/iptables-extensions.man.html#lbAW and set up a hashlimit rule.

For example, we can create a rule that will DROP packets that are NEW, coming to the http server (at port 80), and are coming too fast. In this case too fast means 20 in every second. We will enforce this limit by source IP address (hence the --hashlimit-mode srcip). The hash table name will be "http", so we can use this very hash table in other rules too if we need.

iptables -I INPUT -m tcp -p tcp --dport 80 -m state --state NEW -m hashlimit --hashlimit-above 20/sec --hashlimit-mode srcip --hashlimit-name http -j DROP

Verify that the rule is active by issuing iptables -L.

Chain INPUT (policy ACCEPT)
target     prot opt source               destination         
DROP       tcp  --  anywhere             anywhere             tcp dpt:http state NEW limit: above 20/sec burst 5 mode srcip


Chain FORWARD (policy ACCEPT)
target     prot opt source               destination         

Chain OUTPUT (policy ACCEPT)
target     prot opt source               destination

(Yes, it is. Also, please note that this will limit outbound traffic too, since SYN,ACK response packets are considered NEW.)

Burst?

As you can see, there is a "burst 5" note in the output above. This is the default "burst" value for every hashlimit rule. This has things to do with how the whole limiting stuff is implemented.

When a packet comes in and hits the hashlimit rule, and it also matches all the other criteria for that rule, the hashlimit engine kicks in, and tries to find an entry in a hash table for that packet. In our case, it will hash the source IP address, and store a counter at that hash value, so our rule will maintain a counter for every source IP address.

That counter decreased upon each hit. If the arrival of a packet tries to push the counter below zero, it means that we have just hit the limit. In our case it means that the packet must be dropped. If there isn't any counter stored for the current hash, one will be created and initialized to 1, so the fresh packet will be granted access, and the counter will be set to zero immediately.

The counter is also incremented in certain intervals, and the interval depends on the limit. In our case, the limit is 20/sec, so the counters are incremented by one in every 1/20th of a second, until it reaches the burst value. A counter can never be greater than the burst value.

Actually this description simplifies things a bit, but not too much. The truth is somewhat uglier than that. See the Linux kernel itself for the gory details: http://lxr.free-electrons.com/source/net/netfilter/xt_hashlimit.c#L382

Testing with ab

Let's run ab (Apache Benchmark) to see what we've achieved. You need to have an apache server running locally. The default installation and the default "It works!" page will do just fine.

ab -c 1 -n 10 http://localhost/

This is ApacheBench, Version 2.3 <$Revision: 1430300 $>
Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Licensed to The Apache Software Foundation, http://www.apache.org/

Benchmarking localhost (be patient).....done


Server Software:        Apache/2.4.6
Server Hostname:        localhost
Server Port:            80

Document Path:          /
Document Length:        11 bytes

Concurrency Level:      1
Time taken for tests:   9.001 seconds
Complete requests:      10
Failed requests:        0
Write errors:           0
Total transferred:      2550 bytes
HTML transferred:       110 bytes
Requests per second:    1.11 [#/sec] (mean)
Time per request:       900.064 [ms] (mean)
Time per request:       900.064 [ms] (mean, across all concurrent requests)
Transfer rate:          0.28 [Kbytes/sec] received

Connection Times (ms)
              min  mean[+/-sd] median   max
Connect:        0  899 315.9    999     999
Processing:     1    1   0.2      1       1
Waiting:        0    1   0.2      0       1
Total:          1  900 315.8   1000    1000
ERROR: The median and mean for the waiting time are more than twice the standard
       deviation apart. These results are NOT reliable.

Percentage of the requests served within a certain time (ms)
  50%   1000
  66%   1000
  75%   1000
  80%   1000
  90%   1000
  95%   1000
  98%   1000
  99%   1000
 100%   1000 (longest request)

But WHY??? 1.11 requests / second is totally not 20 requests / second. What isn't working properly?

TCP. And it's working just fine.

One can try and attack the local web server on multiple threads, hoping that the effect of the TCP retransmission timeout will disappear once there are many parallel connections. Let's try this one:

ab -c 10 -n 10 http://localhost/

So all ten requests are made at once. Still, we see only 5 requests per second. How can that be?

This is because ab will try to make requests on all threads at once, and there's no chance for the counter to be incremented by even one. As soon as you issue the command, almost all SYN packet will be dropped, and the threads will wait for their chance to retransmit. When they get their chance, the counter will be at the the burst value, and when the threads send their retransmissions, <burst> number of threads will be successful.

One way to circumvent this is to either reduce the retransmission timeout, or wait a little between each requests. Unfortunately, retransmission behaviour currently can't be set per-socket under Linux. There is a patch for that however, so this feature might make it's way into the mainline code later.

Also please note that if we turn "keepalive" on, then nothing will hold ab back:

ab -c 1 -n 10 -k http://localhost/

I got a nice 4500 reuqests / second with this. Fortunately DOSers rarely use keepalive, and it can also be turned off at the server side. Of course one should carefully consider (and measure) the effect of keepalive settings, it can have a quite heavy impact.

Testing with curl

The following piece of bash script can be used to experiment with hashlimit rules, and their effect on the traffic:

while true; do curl http://localhost/ &> /dev/null; echo -n "#"; sleep 0.05; done

The 0.05 seconds sleep will correspond to 20 reqs/s (or a little less, because curl needs some time to run too). If you try to decrease this, eventually you will see that the flow of # characters is being interrupted for a short period of time every now and then.

If you sleep to little, the second SYN packet will be dropped, and curl will have to wait to retransmit, limiting the connection rate down to 1/<retransmission timeout> reqs / sec in case of a single thread. This causes the above script to print one # character in every second.

Conclusion

TCP is a very sturdy protocol and can be configured to fit many environment. You can use it between two processes on the same core, or you can put a device to an orbit around planet Mars and expect a connection to stay alive despite the 6 - 42 minutes "ping" and probably heavy packet loss.

Even the desktop-default TCP stack can deal with the packet loss caused by hashlimit rules, and this is the way hashlimit rules supposed to do their work, so tuning for no packet drops is an unnecessary and futile effort.

Despite the fact both TCP and hashlimit rules operate along quite simple principles, their interaction can produce fairly complex behaviour. The behaviour of a system with very simple rules can be arbitrarily complex. Just think about how simple rules define the Mandelbrot set.

Because of this, you should always be aware of how the tools you use actually work. No metaphors or graphic images-in-mind will be able to help you pinpoint the source of a complex (mis)behaviour like an ab test producing 1-5 reqs/s under a hashlimit 20/sec rule. You need to know the system works exactly, nothing else will work.

Firefox and testing concurrent web application

2014-04-20T00:04:00.003-07:00

TL;DR:

firefox won't start a request for an URL if a request is already being executed for that same URL.

If you open http://example.com/foo, and it takes several seconds to complete, when you simultaneously open a new tab and enter the very same URL, the latter request WILL NOT BE SENT until the former has completed.

If you ever need to make parallel requests for the same URL make sure you put some random query string in them, so the browser won't serialize requests for them.

I was testing a concurrent web application written in Haskell, Yesod. I love Haskell and the Yesod framework, this combination gave me the best web development experience so far. I especially loved how easy it was to develop a simple web-based chat application by using TVars and TChans - variables and "channels" - basically message queues - to store state in a Yesod web application and pass messages between threads.

I stumbled into a strange bug, and it took a good two hours of my life.

For those who aren't familiar with Yesod, just imageine a usual MVC-based web application. My problem was to block the execution of a thread executing a certain controller, and continue it when an other thread executing an other controller sends a message down a pipe (a TChan to be more precise). This makes long polling possible.

This is easy to achieve in Yesod, since it is multithreaded by default, and it isn't particularily hard to set up the TChan and use it from the Handlers (controllers).

But when I tried to test the application, the strangest thing would happen.

I opened up a browser (Firefox) and entered the URL that should been blocked, and it did: the loading indicator spun indefinitely until the fairly long timeout has been expired.

When I simultaneously opened the other URL for the message sending controller, the receiver URL loading completed immediately, and the message appeared on the screen.

So far, so good.

So let's open two receivers, and thest the broadcast capabilities of TChans!

I opened up two receiver tabs, both started to load. Then I opened up a sender, and ONLY ONE RECEIVER got the message.

I spent about two hours debugging, including digging into the Yesod codebase trying to figure out what was going on.

Finally I realized that the two receiver executions are completely serialized, and this could not happen inside Yesod: testing with curl and ab showed that calls to the same Handler are correctly being served in parallel.

Then it hit me. What if Firefox blocks the execution of requests for the SAME URL for - I don't know - preventing simple (intentional or unintentional) DOS scripts from running.

Aaaand I was right. Adding a "?asdasd=asdasd" after one of the receiver URLs fixed the problem, the requests was made parallel and the broadcast feature worked like a charm.

How to setup a raspberry PI as an IPv6 router with a SIXXS tunnel

2014-04-08T14:52:00.004-07:00

IPv6?

Internet Protocol version 6 is a network protocol that will soon replace the current Internet Protocol version 4 - the protocol that runs the Internet, and has those familiar four-number addresses like

183.43.221.13

With IPv6 are coming a plethora of changes, probably the most end-user-alarming one will be the change in the address format demonstrated above.

The four-times-one-byte address will be replaced by a eight-times-two-bytes address that looks like some fancy password from a bad computer movie.

The new addresses will take the form of

4367:9987:a01b:0000:0000:0007:cafe:babe

According to the simplification rules of IPv6 addresses, this can be written as

4367:9987:a01b::7:cafe:babe

Since with a double colon you can jump through all of the all-zero address parts, and leading zeros can also be omitted.

What will the Romans ever do for us?

We did not only sucked almost all the oil out of the Earth's crust, but we really almost used up all of the IPv4 addresses.

Now look around you and take a mental note of the feeling of right now, because when your grandchildren will ask you how it felt when all of the IPv4 addresses were used up, you will have to describe this exact feeling.

Our oil supplies will be enough for quite a few decades, but out IPv4 addresses won't last an other one. Our pool of IPv4 addresses is quickly running dry.

IPv6 offers us a mindboggling 2¹²⁸ addresses. That's right, since the original IPv4 address pool size were a mere 2⁶⁴, the new pool size will be the old pool size squared, so every device that has an IPv4 address today could get as many IPv6 addresses as there are IPv4 addresses on the face of the Earth.

And that is a lot. (Not counting with NAT of course ;) )

If the IETF would make this happen again (squaring the address pool), then a new IP address could be assigned to every thousandth or so atom in the known universe.

Besides the exuberant addresses, IPv6 brings to us an other few things play with, such as:

Direct connectivity, no NATs. Again, for there are a plenty of available addresses.
Nice zero-configuration LANs, no router or other coordination is needed.
Multicast - send traffic to many hosts at once.
Anycast - send traffic to the closest host holding the given anycast address.
Simpler message format, faster routing, faster Internet.
Built-in security
Better QoS - better handling and controlling of independent data streams between hosts, making possible to throttle and prioritize traffic more efficiently.
Better mobility support - switching between networks is much easier.
Better administration: zero configuration, network renumbering and easy multihoming (using more than one internet connections).
Smooth transition from IPv4

This last bullet point made possible by being able to "map" all the current IPv4 addresses into a tiny little fraction of the available IPv6 addresses.

This with the availability of IPv6 tunnel services (like SIXXS) can ship you into the future without causing severe (or any) seasickness.

SIXXS?

SIXXS is an IPv6 deployment & tunnel broker. Anyone can register and get an IPv6 address range the size of the current IPv4 address space. I for example own three of them at the moment.

If you register at https://www.sixxs.net/ you can get an IPv6 address range and a possibility to build a tunnel through the old Internet into the new one.

Tunnelling works by directing all non-local IPv6 traffic to a distant machine, with IPv6 packets encapsulated in IPv4 ones. You might not get the speed benefits, but you will get all the others, with the priceless feeling of being an early adopter of a wonderful new technology.

You can then browse the Web and use almost all your usual applications over IPv6, and can access the growing number of resources only available on the new net - on hosts with only IPv6 addresses.

Your SIXXS account will be credited with a few ISKs when you open it. A handful of ISKs are need to request your tunnel (and therefore address (range)). ISKs accumulate by keeping your tunnels up and running - something you probably planning to do anyway.

If you would like to experiment with your home devices, you need an IPv6 - and SIXXS tunnel - capable router.

Bummer.

Not many routers can set up tunnels to SIXXS PoPs out of the box, so we need to do some geeky stuff to convert your household to IPv6.

Raspberry PI?

You probably know - and chances you own - a small single board computer called a Raspberry PI. This little device has a slow-but-OK ARM CPU, some OK amount of memory for a very low price. If you don't have one, make sure you buy two, for about a hundred USDs it's a fair deal.

Google for your local supply of PIs.

Raspberry PI setup - aiccu

Raspberry PI can run Linux, make sure you have the current official version of Raspbian running on it.

After you have requested your tunnel - and it got approved in a few hours (or days if you are unlucky), you can set up your PI to connect to the new Internet.

Make sure you request a dynamic, NAT traversing AYIYA tunnel, since it is almos certain that you're behind a NAT.

The client program you need to install is aiccu. Make that happen by issuing the following command:

sudo apt-get install aiccu

This will install the software and the dependencies. The aiccu install script will ask for your user name and password, and if you have more than one tunnel, you will have to select the appropriate one here.

Aiccu will start up, and there you go:

root@minotaur:~# ping6 google.com
PING google.com(bud02s04-in-x06.1e100.net) 56 data bytes
64 bytes from bud02s04-in-x06.1e100.net: icmp_seq=1 ttl=57 time=4.99 ms
64 bytes from bud02s04-in-x06.1e100.net: icmp_seq=2 ttl=57 time=7.76 ms
64 bytes from bud02s04-in-x06.1e100.net: icmp_seq=3 ttl=57 time=6.24 ms
64 bytes from bud02s04-in-x06.1e100.net: icmp_seq=4 ttl=57 time=3.86 ms
^C
--- google.com ping statistics ---
4 packets transmitted, 4 received, 0% packet loss, time 3004ms
rtt min/avg/max/mdev = 3.865/5.719/7.767/1.452 ms

You can see into the future. Minotaur is my Raspberry PI, the command I used is ping6, the target machine was google.com.

But what about the other devices on the network? They can't. Your RPI has to act as a router between them and the rest of the IPv6 Internet.

Fortunately this can easily be done. Just install radvd, a router advertisement daemon that will periodically yell at your local devices, telling them about the great opportunity of becoming a test subject for your experiment:

sudo apt-get install radvd

Radvd needs a configuration file at /etc/radvd.conf:

interface eth0 {
    AdvSendAdvert on;
    prefix 2a01:368:e000:8074::/64 {
        AdvOnLink on;
        AdvAutonomous on;
        AdvRouterAddr on;
    };
};

Now hold on for a while, that is my ip address prefix, you have to substitute that for your own. Take a look at your SIXXS User Home page under the title "Subnets", and look for "Subnet Prefix".

If you start radvd after this, well, it might not want to start, complaining about IPv6 forwarding being turned off.

Raspberry PI setup - sysctl

Allow IPv6 forwarding by creating the configuration file /etc/sysctl.d/local.conf containing:

net.ipv6.conf.all.forwarding=1

Raspberry PI setup - interfaces

Linux won't forward IPv6 packets from an interface if there is no IPv6 address of that interface is known. So we will have to set one up.

Edit /etc/network/interfaces and add

iface eth0 inet6 static
 pre-up modprobe ipv6
 address 2a01:368:e000:8074::101
 netmask 64

to it. Please replace the IPv6 address to one with your SIXXS-provided prefix. You can make one up by writing a suitable number after your prefix.

Raspberry PI setup - modules

The sysctl part only works if the ipv6 kernel module is loaded before sysctl tries to set up forwarding. This can be done by editing the file /etc/modules and inserting a single line into it:

ipv6

This way the proper kernel driver will be loaded at every boot.

After all of this, you need to reboot your PI. Make sure aiccu and radvd is running.

Set up your other devices.

Hah! I was joking. You don't have to. Actually I had to reconnect to my wifi with my Samsung Galaxy S4 to notice the change, but my desktop computer picked up the router configuration without me having to intervene.

The IPV6 test site http://test-ipv6.com/ scored 10/10.

Amateur (HAM) radio: analyzing historical data from reversebeacon.net

2013-10-26T07:00:00.002-07:00

What is amateur radio?

Amateur radio (or HAM radio) is a great hobby, or sport - it depends on how seriously do you take it, and what activities you choose to engage in precisely.

Basically HAM radio is a pastime where people around the world buy and/or build radio transceivers, antennas and lots of other gear, and make contacts with each other using the frequencies and modes that complies with rather strict regulations.

Amateur radio operators also participate in emergency communication, since no infrastructure is required, and with the proper tools, they can talk over thousands of kilometres.

Despite the aforementioned strictness of radio regulations, there are multitude of ways you can "do HAM radio".

If you enjoy chatting, you can buy a cheap hand-held device and check into a "net" on a local repeater. If you like "hunting DX" - making long-distance contacts on different continents (or covering an unusually large distance at the given circumstances), than you might want to invest into a good short wave radio and a suitable antenna.

You can also build your own radio, antenna and necessary equipment AND hunt DX with that!

What is reverse beacon?

Reverse beacon (http://reversebeacon.net/) is a service that gathers information from hundreds of radio receivers around the world. These radio receivers listen on various bands, constantly trying to decode the communication.

The network currently mostly listens to "CW" - continuous wave communication or other words Morse code.

Yepp, Morse code is still in use, actually this method is very popular in the amateur radio community. It is particularly easy to build Morse receivers and transmitters, so a lot of the DIY folks speak Morse.

Also, Morse code can be decoded automatically, so reverse beacon can intercept amateur radio Morse code communication.

The intercepted communication can be searched and displayed on the website, even in real time. Cool.

A very nice feature of the RBN is that it makes the historical data available for us. Free. Thank you folks!

RBN stores various interesting info about the intercepted messages. Who sent it, who heard it and when; how loud was the signal and how fast the pace of the dits and dahs were.

Today I've counted 221.582.054 rows in the database. That's a big pile of data to chew on. Let me tell you how I do it.

Choosing a database engine

Considering the size of the data set, you may call this "big data". You could shove it into mysql or postgresql, or something similar, but you'd need to build proper indexes, or just need to be very patient. Loading the data also might take long, hours maybe.

My choice is InfiniDB.

InfiniDB is a columnar database. There is a free, and an enterprise version. The free is more than fine for us.

It stores in a column continuous-, instead of row continuous fashion. This (and many other design features) is to greatly reduce IO: reading 220 million rows from the disk is no joke in case you have to do a full table scan. InfiniDB never do that.

Instead it does a full column scan, so it won't read data you don't need: in mysql, a simple select <column> from <table> would read or skip through all data in the table, including columns you don't need (well, this is more complicated, but for now this is enough).

You also don't need to create indexes. Actually, the data is structured so that you could say that in InfiniDB data is the index. Whatever query you throw on InfiniDB, it will be kinda fast. (See the concept guide for details. Also, you might want to read a bit about how InfiniDB compares to other, row-based databases.)

Unless you want to do some crazy SQL magic, which just won't work, or will be very slow, since InfiniDB does not support SQL entirely, only a subset of it.

Installing InfiniDB

You can get the installer after registering and logging into infinidb.org.

The installation procedure is simple, read the getting started guide.

Basically what you have to do is to uncompress the downloaded archive under /usr/local. It will create a directory /usr/local/Calpont. After that, run /usr/local/Calpont/bin/install-infinidb.sh as root.

The InfiniDB isntaller is made for servers, so you have to adjust certain parameters to be able to run InfiniDB correctly.

Open up /usr/local/Calpont/etc/Calpont.xml, and look for a line like this:

<NumBlocksPct>66</NumBlocksPct>

The number between the XML tags is a percentage. InfiniDB will acquire this percentage of the totaly physical system RAM for it's own block cache. I've seen this value set by the installer as high as 80%, and it's almost always unacceptable on a desktop computer. Change this to whatever you think is OK.

Also, please try to set this as high as you can easily tolerate, since the block cache can (and will) make queries faster by reducing, or eliminating disk IO.

Fortunately I own a computer with 32 gigabytes of RAM, and the total InfiniDB data directory is about 27 gigabyte, so I can store most of the reverse beacon data in my RAM. After a few queries, things speed up quite a bit, and further queries don't even make my HDD LED blink.

Since we will use a single table for the RBN data (wich is kinda suboptimal, but hey), we don't really need to care about and optimize join behaviour, but InfiniDB tuning is quite an interesting topic, so if you too care, read the tuning guide.

Getting the historical data

First of all, InfiniDB is "just" a mysql storage engine, so by installing inifidb, you install a mysql instance as well. Make sure to stop any running mysql databases before attempt to install infinidb. (Also, you can configure either InfiniDB or your existing mysql to run on a different than the default 3306 port, by editing the proper my.cnf file.)

You can download raw data from http://www.reversebeacon.net/raw_data/. It not so entertaining to download and process almost two thousand files by hand, so I've written a python script to do the job. Please be polite, and don't comment the out the line that waits between downloads - I don't want anyone to slam the server.

#!/usr/bin/env python
# -*- coding: utf8 -*-

import os
import time
import urllib2
import datetime

base_url = "http://www.reversebeacon.net/raw_data/dl.php?f="

today = datetime.date.today()
delta = datetime.timedelta(1)

i = datetime.date(2009, 2, 21)

while i <= today:
    datestr = "%d%02d%02d" % (i.year, i.month, i.day)
    fname = "RBNDATA/%s.zip" % datestr
    url = base_url + datestr

    if os.path.isfile(fname):
        print "%s exists." % fname
        i += delta
        continue

    print "Downloading %s..." % url

    try:
        content = urllib2.urlopen(url)
        f = open(fname, "w")
        f.write(content.read())
        f.close()
    except urllib2.HTTPError:
        print "Error downloading %s" % url

    i += delta
    time.sleep(5)

Put this script into an empty directory as "download_rbndata.py" and also create a directory named "RBNDATA" next to it. Cd into the script's directory, and run it with python or pypy (issue "python download_rbndata.py", or "pypy download_rbndata.py" command, or just give it an executable flag and fire ./download_rbndata.py).

The download will take about 3 hours. Be patient. Once you've downloaded the files, you can fire the command again, and it will not download the files you already have, so you can gradually gather your own pile of RBN data without re-downloading everything over and over.

If you set up InfiniDB properly, you need one more trick to get a DB shell.

Open up a bash shell and do

. /usr/local/Calpont/bin/calpontAlias

Notice the space between the dot and the rest of the line!

This script will set up command aliases, so - for example - you can get an infinidb shell, by issuing:

idbmysql

After this, a familiar mysql shell will greet you. If you don't want to type the calpontAlias command every time you open a shell, make sure you include it in your .bashrc file. (This step is optional)

Also, you might want to put /usr/local/Calpont/bin into your PATH as well. (This is not vital either.)

Converting and improving on quality

We will use InfiniDB's cpimport tool to import the data into the database.

But first we have to take care of a few problems:

The format of the files are slightly different in different time intervals. Some have headers, some don't, there are files with more columns than others, and the last row contains no data.

Also, cpimport can't read ZIPs, so we have to uncompress the whole thing and make it into a single csv file.

All these things are taken care of by the following python script. Save this as "build_import.py", and put next to the file and RBNDATA directory described above.

#!/usr/bin/env python
# -*- coding: utf8 -*-

import os
import glob
import zipfile

basedir = "RBNDATA"
importfile = "rbndata.csv"

zips = glob.glob(basedir + "/*.zip")
zips.sort()

impf = open(importfile, "w")

for z in zips:
    f = zipfile.ZipFile(z, "r")
    content = None
    csvname = f.namelist()[0]
    print csvname
    csv = f.open(csvname, "r")
    for line in csv:
        if line[:8] == 'callsign':
            print "(dropping header)"
            continue
        if line[:1] == '(':
            print "(dropping last line)"
            continue
        extra = 12 - line.count(',')
        impf.write(line.rstrip() + (',')*extra + "\n")
    csv.close()
    f.close()

impf.close()

Run this just like the other script. It will read the ZIP files and create a huge csv (more than 16 gigabytes). This can be imported into InfiniDB.

Importing the data

Make sure your InfiniDB instance is up and running (use /etc/init.d/infinidb start if it isn't).

Start an idbmysql shell, and run the following sql:

CREATE DATABASE rbndata;

use rbndata;

CREATE TABLE rbndata (
  callsign VARCHAR(16),
  de_pfx VARCHAR(6),
  de_cont VARCHAR(4),
  freq DECIMAL(12,4),
  band VARCHAR(6),
  dx VARCHAR(16),
  dx_pfx VARCHAR(6),
  dx_cont VARCHAR(4),
  mode VARCHAR(6),
  db INT,
  logged_at DATETIME,
  speed INT,
  tx_mode VARCHAR(6)
) ENGINE=InfiniDB DEFAULT CHARSET=latin1;

Now you can use cpimport to do the job:

$ sudo /usr/local/Calpont/bin/cpimport rbndata rbndata rbndata.csv -s ,

Of course, you have to stand in the directory where the huge rbndata.csv file is, which was created by out python scripts.

Also notice the comma after the -s switch. This is how we set the field separator, so it's absolute mandatory.

Also, as the saying goes, "this might take a few minutes". Indeed, it only took a few, my computer chugged this bottle of bytes down in an amazing 510.5 seconds, swallowing about 430 thousand rows per second. Wow.

You can verify the bulk load by issuing the following command at the idbmysql shell:

mysql> select count(*) from rbndata;
+-----------+
| count(*)  |
+-----------+
| 221582054 |
+-----------+
1 row in set (2.42 sec)

This might be a lot slower in your case, since at this time, your block cache is cold, and you have to read a lot from the disk at this time. My cache was hot, and InfiniDB only had to hit RAM to calculate the rows in the table.

Your results also may vary, since after the publication of this article many more contacts were logged by the RBN.

Some interesting queries

Now we have an InfiniDB up and running, have our data inside it, so let's the fun begin!

We already see how we calculate the total number of rows, which is simple and boring.

Let's see the different callsigns!

select count(distinct dx) from rbndata;

There is an astonishing 690,843 different callsigns in the database. Of course a callsign might be there as a (let's call it) a "simple" one, like mine: HA5FTL, or the operator might have worked outside the shack, on a field, so HA5FTL/P -like callsigns are also there, so a single station or operator might even appear 3-4 different way.

Counting just the simple cases gives us:

select count(distinct dx) from rbndata where dx not like '%/%';

614,185 different callsigns.

Let see portables:

select count(distinct dx) from rbndata where dx like '%/P';
16,409

Mobiles:
select count(distinct dx) from rbndata where dx like '%/M';
4783

Maritime mobiles:
select count(distinct dx) from rbndata where dx like '%/MM';
873

Air mobiles:
select count(distinct dx) from rbndata where dx like '%/AM';
34

Ok, so how much automatic receiver stations are (were) in the RBN?
select count(distinct callsign) from rbndata;
895 is the result. That's a lot of stations!

Since listening stations come and go, let's see how many were working in august:
select count(distinct callsign) from rbndata where logged_at between '2013-08-01' and '2013-09-01';
Result is 197. Notice the speed: this query run under one second on my machine.

Cool isn't it? And it gets way cooler than that.

See how much loggers on each continent:

select de_cont, count(distinct callsign) as loggers from rbndata group by de_cont order by loggers desc;

And the result is:

EU: 502
NA: 284
AS: 53
SA: 33
OC: 19
AF: 5

So Europe has the largest number of logger stations, and Africa has disappointingly few. Also, Asia and South America could use some volunteers, and the people of Oceania are also have to pull themselves together ;)

You can do the same with the DXes, or the call signs that were recoded:

select dx_cont, count(distinct dx) as dxes from rbndata group by dx_cont order by dxes desc;

EU: 376185
NA: 178105
AS: 83266
AF: 19333
SA: 17755
OC: 16859

No surprise, EU leads the list, North America follows. But what's interesting is Africa's position. This is probably because Africa is a popular target of DX expeditions: trips to places that rarely see HAM radio activity due to the lack local amateurs, or even human beings (like small bare-rock islands in the middle of nowhere). Africa also have a HAM radio life of it's own, in contrast to what you might think first.

(Hm, I miss Antarctica. Don't laugh, there IS some HAM radio activity there, I even made contact with RI1ANF, a station at Bellinghausen Base, King George Island, which technically belongs to Antarctica).

We can play with the bands too. Let's see what bands are the most popular (or at least the most popular in RBN's database):

select band, count(distinct dx) as c from rbndata group by band order by c;

20m: 378063
40m: 372545
80m: 203150
15m: 173935
30m: 120276
10m: 93178
17m: 82932
160m: 63508
12m: 36151
6m: 25389
2m: 2595
60m: 1200
4m: 162
472kHz: 112
70cm: 66
137kHz: 1

Aaaand the great trio leads: indeed, 20, 40 and 80 meters are very popular, and you can always find something on 30 meters, and when the propagation is good, great DXes keep popping up on 15 meters.

If you lack antenna space, I'd definitely recommend you to build something (a wire antenna, or an aluminium stick) that does not too bad at 15 and 20 meters, and you just might be able to work with local stations on 40 meters up to maybe 1-2 thousand kilometres, if you're lucky (and that stations has a proper antenna).

An other good question is: how each of these band perform at great distances? unfortunately I don't have the exact location of these stations, however I could buy access to qrz.com's huge database, and grab location information there.

Let's define a "DX log entry" as an entry that have different data in the dx_cont and de_cont field, e.g. the logger station and the calling stations were on a different continent:

select band, count(*) as c from rbndata where de_cont <> dx_cont group by band order by c desc;

This will give us the following list:

20m: 25547420
40m: 15628909
15m: 12397729
10m: 4076702
80m: 3660948
17m: 2848269
30m: 2701406
12m: 1012281
160m: 927274
6m: 42104
60m: 5734
2m: 895
4m: 119
472kHz: 10
70cm: 4

So if you want to work DX, you will probably have luck on 20 meters (also don't forget that the data is skewd, since the distribution of logger stations is not uniform).

We can also ask the question: which band is more like a "DX band", and wich is more of a "local band"? Let's see:

select band, sum(if(de_cont <> dx_cont,1,0))/count(*) as c from rbndata group by band order by c desc;

12m: 0.6003
10m: 0.5405
15m: 0.5262
17m: 0.4434
20m: 0.3697
40m: 0.2380
30m: 0.2296
60m: 0.1714
80m: 0.1433
160m: 0.1001
4m: 0.0474
6m: 0.0445
2m: 0.0238
70cm: 0.0071
472kHz: 0.0036
137kHz: 0.0000

So what we've got here? The ration of DX/same continent contacts is the higher on 12 meters. We can speculate that you'll do more DX on 12 meters than not. (Of course, never forget that the data is heavily biased. We should examine the distribution of listening stations through bands and continents and apply some clever de-biasing before ever stating something like that!)

Despite the bias, the figure above are somewhat reflect reality: the higher HF bands are more like DX bands, and the lowers are generally easier to use for local communications. This is because the middle (and sometimes the upper) HF frequencies are more likely to be reflected from the upper atmosphere. The low frequencies are absorbed, the high frequencies are leaving the Earth into space. (Yes I know, this is much more complicated, and the reflectivity of the ionosphere is a science of it's own. Also it's noteworthy that it is a very cool piece of science that potentially can greatly benefit from the data gathered by the RBN.)

I have a humble station with an Elecraft K2 as my main radio along with a few work-in-progress DIY rigs. My antenna is also just a 4 metres long stick of aluminium tube sticking out from my window.

How such amateur station can perform?

Let's see:

select dx, count(*) from rbndata where dx LIKE 'HA5FTL%' group by dx;

HA5FTL: 716
HA5FTL/P: 32

Well, that's not much. Partly because of my low-level signal, partly because I rarely "call CQ", so my callsign is mostly only heard when I answer to a general call. Weak CQ-s are rarely answered, so I listen and answer calls myself mostly.

But how the continents can hear me?

select dx, de_cont, count(*) as c from rbndata where dx like 'HA5FTL%' group by dx, de_cont order by dx, c desc;

+----------+---------+-----+
| dx       | de_cont | c   |
+----------+---------+-----+
| HA5FTL   | EU      | 654 |
| HA5FTL   | NA      |  46 |
| HA5FTL   | AS      |  14 |
| HA5FTL   | AF      |   1 |
| HA5FTL   | SA      |   1 |
| HA5FTL/P | EU      |  31 |
| HA5FTL/P | AS      |   1 |
+----------+---------+-----+
7 rows in set (6.99 sec)

So you only have a good chance to hear me if you're living in Europe. Also if you own a decent antenna, we might be able to make contact if you're living in Canada's or the USA's east coast. Asian and African HAMs will have a hard time hear me at all. (Also, this low number of logs can be attributed to the low number of loggers at AF and AS, but let's just be realistic: my shack is far from ideal).

Let's see my favourite bands (beside Infected Mushroom and Shpongle :P):

select dx, band, count(*) as c from rbndata where dx like 'HA5FTL%' group by dx, band order by dx, c desc;

+----------+------+-----+
| dx       | band | c   |
+----------+------+-----+
| HA5FTL   | 40m  | 288 |
| HA5FTL   | 20m  | 255 |
| HA5FTL   | 15m  | 100 |
| HA5FTL   | 17m  |  31 |
| HA5FTL   | 80m  |  23 |
| HA5FTL   | 10m  |  10 |
| HA5FTL   | 30m  |   9 |
| HA5FTL/P | 40m  |  26 |
| HA5FTL/P | 10m  |   3 |
| HA5FTL/P | 20m  |   3 |
+----------+------+-----+
10 rows in set (7.04 sec)

Indeed, I mostly work on 40 and 20, occasionally on 15 meters, rarely on others.

We also could see the "DX ratio" on each band:

select dx, band, sum(if(dx_cont <> de_cont,1,0))/count(*) dx_ratio, count(*) calls_heared from rbndata where dx like 'HA5FTL%' group by dx, band order by dx, dx_ratio desc;

I've included the calls_heared column so we can see how precise the dx_ratio might be:

+----------+------+----------+--------------+
| dx       | band | dx_ratio | calls_heared |
+----------+------+----------+--------------+
| HA5FTL   | 15m  |   0.2200 |          100 |
| HA5FTL   | 10m  |   0.2000 |           10 |
| HA5FTL   | 20m  |   0.1216 |          255 |
| HA5FTL   | 17m  |   0.0968 |           31 |
| HA5FTL   | 40m  |   0.0139 |          288 |
| HA5FTL   | 80m  |   0.0000 |           23 |
| HA5FTL   | 30m  |   0.0000 |            9 |
| HA5FTL/P | 20m  |   0.3333 |            3 |
| HA5FTL/P | 40m  |   0.0000 |           26 |
| HA5FTL/P | 10m  |   0.0000 |            3 |
+----------+------+----------+--------------+

It looks like it's worth getting on 15 meters if I want to work DX from home, and If the /P 20 meters dataset size would be bigger at the same ratio, I'd say that 20 meters with a random wire on a field is good for DX, unfortunatel 3 logged contacts is waaay too low to mean anything.

I must note this point that all of these queries run under half a minute, and mostly under 10 seconds. On 200 million rows with group by-s and stuff.

Well I think you all get the picture now. If you have some interesting queries, or even questions, just write a comment below.

Programming the Bluegiga BLE112 Bluetooth 4.0 module with Linux

2013-05-13T08:02:00.003-07:00

Why?

BLE112 is a Blutooth 4.0 or Blutooth Low Energy (BLE for short) module that contains a microcontroller packed with an awsome firmware that lets you write Bluetooth 4.0 applications in a very user-friendly language called BGScript.

WARNING!!! if you follow the guide below, you're gonna brick the chip. The recent versions of BLE* firmwares require a license key that built into the chip, and cc-tool wipes it, effectively disabling the radio on the device. The solution is to use the "BLE Update Utility" from BlueGiga. I couldn't start it on older Windows installations nor on Wine. Work in progress. If you manage to start it in Wine, please share the knowledge. Thank you!

You can get more info on the module at:

http://www.bluegiga.com/BLE112_Bluetooth_Smart_module

If you want to use the module, you need a programmer tool from Texas Instruments called cc-debugger:

http://www.ti.com/tool/cc-debugger

And you need to wire up the chip to the computer. You can read about that at the following two links:

http://blog.bluetooth-smart.com/2012/09/11/programming-the-ble112-with-c-code-using-iar/

http://blog.bluetooth-smart.com/2012/09/16/programming-the-ble112-using-bgscript/

The programming software can be downloaded from bluegiga's techforum. These are a collection of windows applications that let you build the program and there's also a nice tool called blegui2.exe that lets you debug you solution.

To try your bluetooth module, you need a bluetooth 4.0 enabled hardware or a bluetooth 4.0 dongle such as BLED112 from Bluegiga:

http://www.bluegiga.com/BLED112_Bluetooth_smart_dongle

This plugs into an USB port. There is a proper driver that works with Windows. I had no problem to set up the software and hardware on my home machine that runs Windows 8. When I switched to Linux, problems popped up.

First of all, I noticed that when I plugged in the dongle, it kept disconnecting, and connecting again. The following output is from dmesg:

[ 5755.444220] cdc_acm 3-1:1.0: ttyACM3: USB ACM device
[ 5755.984120] usb 3-1: USB disconnect, device number 43
[ 5756.524111] usb 3-1: new full-speed USB device number 44 using uhci_hcd
[ 5756.698098] usb 3-1: New USB device found, idVendor=2458, idProduct=0001
[ 5756.698102] usb 3-1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 5756.698104] usb 3-1: Product: Low Energy Dongle
[ 5756.698106] usb 3-1: Manufacturer: Bluegiga
[ 5756.698108] usb 3-1: SerialNumber: 1
[ 5756.706137] cdc_acm 3-1:1.0: ttyACM3: USB ACM device
[ 5757.224143] usb 3-1: USB disconnect, device number 44
[ 5757.744071] usb 3-1: new full-speed USB device number 45 using uhci_hcd
[ 5757.912164] usb 3-1: New USB device found, idVendor=2458, idProduct=0001
[ 5757.912175] usb 3-1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 5757.912182] usb 3-1: Product: Low Energy Dongle
[ 5757.912188] usb 3-1: Manufacturer: Bluegiga
[ 5757.912194] usb 3-1: SerialNumber: 1
[ 5757.920274] cdc_acm 3-1:1.0: ttyACM3: USB ACM device
[ 5758.464106] usb 3-1: USB disconnect, device number 45
[ 5758.988078] usb 3-1: new full-speed USB device number 46 using uhci_hcd
[ 5759.160089] usb 3-1: New USB device found, idVendor=2458, idProduct=0001
[ 5759.160100] usb 3-1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 5759.160107] usb 3-1: Product: Low Energy Dongle
[ 5759.160113] usb 3-1: Manufacturer: Bluegiga
[ 5759.160119] usb 3-1: SerialNumber: 1
[ 5759.168192] cdc_acm 3-1:1.0: ttyACM3: USB ACM device
[ 5759.704175] usb 3-1: USB disconnect, device number 46
[ 5760.220064] usb 3-1: new full-speed USB device number 47 using uhci_hcd
[ 5760.592121] usb 3-1: New USB device found, idVendor=2458, idProduct=0001
[ 5760.592126] usb 3-1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 5760.592128] usb 3-1: Product: Low Energy Dongle
[ 5760.592130] usb 3-1: Manufacturer: Bluegiga
[ 5760.592132] usb 3-1: SerialNumber: 1

I dug through the Internet for a good few days after I found out that this dongle resets itself if it receives a malformed command. I suspected that there is some process that tries to communicate with it and keeps it resetting over and over.

The solution was to add the device to the udev blacklist. Create a new file under /lib/udev/rules.d (mine is named 77-mm-my.rules), and add the following content:

# BLuegiga BLED112
ATTRS{idVendor}=="2458", ATTRS{idProduct}=="0001", ENV{ID_MM_DEVICE_IGNORE}="1"

The name of the file is important. Use my idea about the name if you don't know udev very well.

After this, restart modem-manager by issuing "sudo killall modem-manager". After this, modem-manager should not interfere with the device.

This trick can save a lot of hardware from being bugged to death by modem-manager. One example is my USB AVR programmer.

After this, ALL bluegiga programs kinda work as expected with wine. No freezes or segfaults! The compliler and builder can be run exactly as you would do from under Windows. Just pay attention to the path / drive mapping where appropriate.

The big surprise is that blegui2.exe is also works like a charm.

The only problem is with wine itself. It does not seem to offer out-of-the-box serial port support.

I managed to find a solution to this problem too:

http://ubuntuforums.org/showthread.php?t=1523814

TL;DR: you have to download a registry file, and integrate it to wine's registry:

$ wget -O comport.reg http://bugs2.winehq.org/attachment.cgi?id=10210
$ regedit comport.reg

After this, wine can show Linux serial devices a Windows comX ports.

To configure the mapping, you have to create symlinks under ~/.wine/dosdevices/comX which point to devices under /dev/ttyXXX. In my case:

$ cd ~/.wine/dosdevices
$ ln -s /dev/ttyACM3 com1

It's actually quite a dull process to look up the device in dmesg and create the symlink myself. Udev can do this! Let's modify the udev rule to create a device symlink to our dongle, and point wine's com1 symlink to that symlink (yo dawg... :) ):

# BLuegiga BLED112
ATTRS{idVendor}=="2458", ATTRS{idProduct}=="0001", ENV{ID_MM_DEVICE_IGNORE}="1", SYMLINK="bluegiga/bled112"

And:

$ cd ~/.wine/dosdevices
$ ln -s /dev/bluegiga/bled112 com1

This way every time we plug out bled112 dongle into the computer, the correct device will be mapped under wine's com1.

So with everything working like that, the only problem is how to write the resulting .hex file onto out ble112 device?

The solution is cc-tool: http://sourceforge.net/projects/cctool/

This tool isn't included in the distributions I know, so you have to download the sources and build it yourself. It's quite easy to do, it follows the usual configure-make-make install process. Just make sure you install the boost and libusb development libraries. If you use Ubuntu, you can do that by issuing the following command:

$ apt-get install libboost-all-dev libusb-1.0-0-dev

After this, you can download and install the cc-tool package. Open up a terminal window, and do the following:

$ wget http://heanet.dl.sourceforge.net/project/cctool/cc-tool-0.26-src.tgz
$ tar -xf cc-tool-0.26-src.tgz
$ cd cc-tool
$ ./configure
checking for a BSD-compatible install... /usr/bin/install -c
checking whether build environment is sane... yes
...
checking for LIBUSB... yes
configure: creating ./config.status
config.status: creating Makefile
config.status: executing libtool commands
$ make
  CXX    src/main.o
...
  CXXLD  cc-tool
$ sudo make install
make[1]: Entering directory `/home/netom/install/cc-tool'
...
make[1]: Leaving directory `/home/netom/install/cc-tool'
$ sudo cc-tool
  CC Debugger device not found

You have to run the cc-tool program as root to be able to access USB. The last line is an error message. It means that the cc-debugger is not connected. When the device is connected, cc-tool recognizes it:

$ sudo cc-tool
  Programmer: CC Debugger
  Target: CC2540
  No actions specified

You can get some info on the target with the -t switch:

$ sudo cc-tool -t
  Programmer: CC Debugger
  Target: CC2540
  Device info: 
   Name: CC Debugger
   Debugger ID: 8561
   Version: 0x05CC
   Revision: 0x0034

  Target info: 
   Name: CC2540
   Revision: 0x20
   Internal ID: 0x8D
   ID: 0x2540
   Flash size: 128 KB
   Flash page size: 2
   RAM size: 8 KB
   Lock data size: 16 B

For more info, just enter cc-tool --help.
Okay. Now everythin should work fine. Let's build the bkble112 thermometer demo. Cd into the ble/example/bkble112 director, and issue the following commands. I'm going to use absolute paths on my own machine. Replace them with your paths.

$ cd ble/example/dkble112
$ wine /home/netom/install/ble/bin/bgbuild.exe project.bgproj
baudm:216 baude:10 rate:57617
UART channel:0
 baudrate   :57600
 actual     :57617
 error%     :0.0295139
 alternate f:2
ports:14336
C:/users/netom/Temp/qt_temp.crifX8
0
C:/users/netom/Temp/qt_temp.dflNl8

RAM Memory
-------------------------------------------------
Core RAM end                    @ 0x00cc5    3269
Top of RAM                      @ 0x01f00    7936
RAM left for data               = 0x0123b    4667
Attribute RAM                   - 0x00007       7
Connections                   1 - 0x00194     404
RAM for packet buffers      109 - 0x0109b    4251

Flash Memory
-------------------------------------------------
Core flash reserved             @ 0x18000   98304
Top of flash                    @ 0x1f800  129024
Flash left for data             = 0x07800   30720
Common configuration            - 0x00070     112
16 bit UUIDs                    - 0x00022      34
128 bit UUIDs                   - 0x00000       0
Attribute database              - 0x00084     132
Constant attributes data        - 0x0007d     125
USB descriptor                  - 0x000c6     198
BGScript                        - 0x001f1     497
Flash for PS Store           14 - 0x07000   28672
$ sudo cc-tool -few out.hex -v
  Programmer: CC Debugger
  Target: CC2540
  Erasing flash...
  Completed       
  Writing flash (128 KB)...
  Completed (1.25 s.)
  Verifying flash...
  Completed (0.54 s.)

If you notice random errors, try programming without the -f (fast) switch.
You can build your own projects similarly to this one.
One more trick: if you are tired entering full paths and wine commands, create and alias

$ echo 'alias bgbuild="wine <install dir>/ble/bin/bgbuild.exe"' >> ~/.bashrc

After this, you only have to type bgbuild project.xml to build your project.

Speeding up bitcoin-qt on Linux

2013-04-02T22:13:00.001-07:00

The problem

After freshly installing bitcoin-qt - the de-facto bitcoin client - to a computer, it starts downloading "blocks" from the network. The last few (tens of) thousands of blocks are especially slow, and the client is using the disk heavily.

Examining the disk usage closely with iostat showed that the disk utilization was 100%, but at a very low data rate. My disk can read and write about 150 MBytes/second if utilized properly, but bitcoin-qt could only write about one megabytes per second.

(EDIT: thanks for the donation, you rock! :) )

The cause

Bitcoin-qt is very nicely written program that uses the guarantees granted by modern file systems to improve data integrity. This means that if you pull the plug in the middle of disk operations, your filesystem will still be able to come back to life after reboot, and will do so quite quickly.

The program does this by calling the fsync() system call a lot. This ensures that when it returns, every byte sent to the file handle will be on the magnetic surface of your disk. Well, it seems to be the case in the default Ubuntu desktop configuration at least.

The ext4 file system - what I use - sends so-called barrier commands to the disk when it fsync() is issued (well, on any journal commit). The barrier ensures that everything is on the disk before it starts to process commands after the barrier. By the time the barrier command returns, everything is on the spindles.

The solution

The block chain download is just painfully slow. Power outages are rare and I have backups. Data integrity therefore doesn't seem like a major concern to me, at least for a few hours while the block chain is downloading. If I could just disable barriers, that might improve performance.

The examples below assume that you keep your block chain on the / partition. If you have a separate partition for /home, you probably have to use that. Anyway if you start tweaking your file system like this, I expect you to know how to figure this out. ;)

This is easy to test, barrier writes can be disabled on-line:

sudo mount -o remount,nobarrier /

This disables barrier writes nicely totally risking the data integrity of your filesystem, unless you're using battery-backed RAID. The performance improvement of the client was drastic. The disk utilization dropped to about 50%, and my desktop programs (like firefox) was responsive again.

Do this only while you're initially downloading blocks. This can be dangerous on the average desktop computer. Always keep backups.

When the client is finished downloading blocks, return to normal operation:

sudo mount -o remount,barrier /

Donate

Oh, of course, you can always send me a few BTCs ;)

Bitcoin: 19uhBv4n8R7aUcJSMsD6vkaLotfXwcqavY

Litecoin: LdiZ2hwCsSh71CUK9zaac5iUS1KnQ5pS6D

Finding a Hamiltonian path - a randomized aproach

2013-03-29T02:24:00.002-07:00

The problem

There is an international programming contest in Hungary held in every year. I like to participate, the problems are very entertaining.

One of the problems in 2003 was an idealized DNA sequence assembly based on short reads. The sequences of course were generated by a computer (they weren't actually sequenced DNA data), and the input was very "clean":

There were given N reads, each L long.
The sequences overlapping exactly at five bases (five characters)
There were no read errors, or any kind of noise

That's it. It felt tempting to build a graph of that data.

Doing it in Python, the adjacency dictionaries (one forward, one reversed) was built in a couple dozen seconds for the largest files. (Python dictionaries are hash maps, they're pretty fast, and scale well.)

Possible solutions

Question is, how do I build a path in such a graph that contains all nodes exactly once?

Such a path is called a Hamiltonian path, and finding one in an arbitrary graph is an NP-complete problem. This means that it is hopeless to come up with an efficient algorithm that finds such paths quickly for small graphs, and only reasonably slower in bigger graphs.

Since the biggest input for this problem contained 10.000 nodes, it looked totally hopeless to exhaustively search through all possible paths, it would take a very-very long time (sun-would-turn-into-a-red-giant long).

Backtrack

...is the name of the process that would be still running under a red sun (worst case). It would also take insanely lot of memory to work.

However, if the graphs would have only a very few (like 1, or two) connections from a node towards other nodes (few branches along the possible paths), then it would work OK for even a few thousand nodes.

Building every possible Hamiltonian path

The idea is to start from a set of paths that contain only one node. Create a set and put a one-long path for every node.

In each iteration:

Try to extend the paths in every way, and build a new set from the extensions.
If it cannot be extended, it won't be in the new set.
When the new set is ready, discard the old one.

Do this until you will have a path that is Hamiltonian.

The problem with this is that the number of paths will be very large after 6-8 steps, and will eat many gigabytes of memory. Handling such a data structure is also very slow.

Discarding paths

Since, we only need a single path, we can try to get rid some of the paths.

I found out that if I randomly throw away paths so that only a few thousand remain after every iteration, I can solve the problems under one thousand nodes.

This is because there are many solutions, and we have a reasonable chance to find one even if we throw some candidates away.

Random path extension

The algorithm above is really just a complicated way of building random paths. Random path building can be done more quickly - therefore it's faster. We start from a random node, and add a random neighbour. If we can't add more, we check if the path is Hamiltonian. If it isn't, we start again.

This algorithm below can solve the problem for every input in reasonable time.

The problem is, that sometimes it takes two minutes to solve the hardest input, and sometimes it takes half an hour. Good thing is that it's very easy to parallelize: just run the same program on every processor core you have. One of them will find the solution quicker than the others.

It is surprising how a simple algorithm like this can solve such a hard problem.

Never underestimate the power of simple randomized algorithms!

ERROR: "Problem with InfiniDB process PrimProc, should be a single version running"

2013-01-17T02:18:00.002-08:00

InfiniDB

Infinibd is a column-oriented database (actually a MySQL storage engine, or to be more precise, a MySQL distribution with this special storage engine). It can handle analytical / reporting queries very well.

ERROR

I installed it on my laptop to form a nice data warehouse / reporting development environment together with Pentaho's tools. It did OK for a while, but it seized to work about a week ago.

Upon startup, the /etc/init.d/infinidb script gave this error message:

Problem with InfiniDB process PrimProc, should be a single version running

DEBUG

Since I saw no PrimProc in the process list, I thought the error message might be a little off. I found the place where it counted various processes in the process list, including PrimProc. I echo-ed the result, and I saw the there were 0 instances running. Another example for a confusing error message.

I took a look into the file /vat/log/Calpont/crit.log and I saw this:

Jan 7 16:01:14 sphynx PrimProc[16613]: 14.857119 |0|0|0| C 28 CAL0045: FATAL ERROR: PrimProc has allocated too much memory! PrimProc is restarting.

PrimProc is InfiniDB's Primitive Processor. It executes the smallest units of called primitives.

FIX

The amount of memory dedicated to the data buffer cache utilized by PrimProc is defined in /usr/local/Calpont/etc/Calpont.xml. It is called NumBlocksPct. The value of the variable is in percent of the phisical memory, and defaults to a very high value, and it can prevent PrimProc from starting, or it can cause heavy swapping. Since I do not use swap, PrimProc can't start with xorg, firefox, openoffice, thunderbird, skype, ... running.

The solution was to set NumBlocksPct to a lower value, in my case 20 is plenty.

Implementing a Hilbert (90 degree shift) filter in Python

2013-01-17T01:55:00.001-08:00

Why?

A digital 90° phase shift filter is an important building block of the so-called Software Defined Radios (SDRs).

And SDR is a radio that has (relatively) minimal hardware, and most of the features are implemented in software. There are no fancy buttons and displays, but there is a UI application that controls the box. The box is a direct conversion receiver. It just converts part of the radio frequency spectrum to the audio spectrum (the reality is a bit more complicated, but you got the idea).

An SDR usually provides two signals that are almost the same, except one signal's frequency components are shifted 90 degrees in one direction.

Given these two audio signals, the software can do anything a conventional receiver can achieve with bulky components or expensive integrated circuits. Plus, the software can be changed easily. Actually this is the greatest thing in SDRs.

The software can demodulate AM signals by LW-MW-SW broadcast stations, or FM usually encountered in the URH spectrum. It can make amateur SSB and CW signals readable, or can decode digital transmissions like DAB. Anything you can imagine.

SSB reception

I'm mainly interested in amateur (HAM) radio, and so I'd like to receive SSB transmissions. To do that, I have to 90° phase-shift one of the incoming audio signals from the SDR box, and sum with the other. The result is an SSB receiver.

FIR filters

I use the numpy.filter Python package to process audio signals. The firwin and firwin2 function are very useful for designing all sorts of FIR filters, but I could not find a built-in function that can readily be used to shift all frequencies by 90 degrees.

A filter is represented by its coefficients. The simplest FIR filter one can imagine is the averaging filter. You take N samples, add them, and divide the result by N. If you encounter a new sample, you just drop one sample from one end of the line, and put the new sample to the other end. You add them and divide by N again. Simple. The general FIR filter is a weighted averaging filter. The coefficients are the weights.

The coefficients are also the filter's impulse response. That is, if you filter a signal that has all zero samples except one, you get all zeroes and the coefficients somewhere in the middle. Example:

Signal:
0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 ...

Coefficients:
0.3 0.4 0.3

Response:

0 0 0 0 0 0 0 0 0 0.3 0.4 0.3 0 ...

As you can see, the response contains the coefficients.

This is important because the impulse response can be calculated from the desired frequency response by the inverse discrete Fourier Transform.

90° phase shift

In our case, this leads us to the code below:

import numpy.fft
import matplotlib.pyplot as plt

coeffs = numpy.fft.fftshift(
    numpy.fft.ifft([0]+[1]*100+[0]*100)
)

plt.plot(numpy.imag(coeffs))
plt.plot(numpy.real(coeffs))
plt.show()

The variable coeffs holds the filter coefficients. It can be used with the numpy.signal.lfilter function to process data.

The code works by calculating the inverse discrete Fourier Transform of a strange frequency response. It reads like this: "pass every negative frequencies, and supress all of the positive frequencies". This series has a complex iDFT. We are interested in the imaginary part of this inverse. The fftshift function rearranges the result. The plt.* function calls shows us the result.

Note that the filter coefficients are complex. An audio signal from and SDR receiver can be processed by taking the left and rigth channel as the real and imaginary parts of a complex signal, and filtering that - also complex - signal. The filtered signal will also be complex, and the imaginary part will be shifted by 90° (at every frequency).

If the signal to be filtered is real-valued, then the resulting filtered signal will still be a complex one: a real, and a 90°-shifted imaginary.

Converting a dual-boot MBR partitioned disk to GPT

2013-01-09T03:12:00.001-08:00

Background story

I always liked computer games. My first PC was a Pentium 100 MHz machine with very little RAM, which was dedicated almost entirely to a RAM disk. My first precious PC was lacking a hard drive. Every time I wanted to use it, I had to boot up from a floppy and copy software from a few other floppies to the RAM disk. One of my favourite games was UFO: Enemy Unknown. It took a lot of time to boot and copy everything in place, so my machine was on most of the time, and I played a lot on it :) This kinda shows my commitment to PC games.

The famous game was recently rebooted by Firaxis as "XCOM: Enemy Unkown". The game was kinda sluggish on my current setup, so I decided to buy new hardware.

The new setup

Asrock X79 Extreme6 GB motherboard (with UEFI)
INTEL Core i7-3820 3.60GHz LGA-2011 processor
2 Corsair Vengeance 16GB quad kits, 2400 MHz
2 SEAGATE Barracuda ST3000DM001 (3000GB, 7200rpm 64MB cache, SATA3)
New PSU, new chassis, display, graphics card, you name it.

After long debugging, involving a Memtest86+ bug it turned out that two identical RAM kits might not work together on full speed, and one kit is flawed, anyway, RMA in progress.

Always keep two copies. And a few backups too.

I always take backup seriously. Programming and fiddling with hardware and software quickly became a profession of mine. I cannot afford to lose my data.

The old setup had one 2TB fixed HDD and a 640GB HDD that was lying in the drawer, holding backups. Important git repositories was also on the Company's dev server, and in S3 too.

The drawback of this solution is downtime. If the fixed disk (holding the OS-es) would have failed, I would have to reinstall everything.

This is why I decided to buy two identical hard disks. My intention was to keep two copies of the whole system(s), so if one fails, I can always plug in the other. No precious time wasted. A new disk and on overnight dd can restore redundancy.

Moving the data to the new disks

Actually "overnight dd" part was when I discovered that something is terribly wrong with the new rig, and the three day nightmare of memtest86+ debugging and data copying started.

Finally, I managed to copy the 2TB old disk to both 3TB disks. Both could be booted up.

I have two operating systems: an Ubuntu Precise for fun & profit, and a Windows 8 for fun only (yes, games). Both systems could be booted up at this point. I use Grub 2.

Too bad the old disks used old MBR partition tables. This kind of partition table can't handle disks larger than about 2 Terabytes, so at this point I had absolutely no use of the 1 extra terabyte at the end of the disk. What a waste of money.

Converting a disk to GPT

To do this, I booted from one HDD, and left the other one in an eSATA rack.

The conversion can be done with the gdisk (for GPT fdisk) software from ubuntu. It can be installed like this:

apt-get install gdisk

To convert an MS-DOS partition table to GPT, just launch gdisk, and hit "w" to write the new, converted GPT to the disk. The data will remain untouched, however, the disk will become unbootable.

I sorted the partitions so that their entries in the partition table were the same as they appear on the disk, so the partition numbering was messed up beyond recognition. The data is still there, but the OSes could not boot up due to several equally forbidding reasons.

I used gparted tool to resize and move the partitions. Windows got almos 1TB, and Linux got the rest.

Converting a disk from MBR to GPT _and_ having Windows to boot up without a reinstall is pretty much impossible with my little knowledge on Windows. The terrible documentation and limited range of tools makes the situation even worse, so I decided to completely reinstall Windows 8 Pro. I've lost no data, since the other disk, and the old disks held the old Windows partition, and I only used it to play with stuff downloaded from Steam, so everything "important" was also in the cloud.

Installing Windows on GPT with UEFI

Windows can only be installed on GPT if it is booted up in UEFI mode.

I realized that the Windows 8 Pro update disk I've bought is only willing to boot up in legacy (a.k.a. BIOS mode). Darn.

The problem is that the CD is constructed in such a way, that it is almost possible to boot up in UEFI mode. The files are there, but the CD's filesystem does not allow UEFI to boot from it. If it would be FAT, it could boot. The solution is to copy the entrire Windows 8 Pro install cd onto a FAT formatted flasdrive. After doing so, Windows started happily in UEFI mode, and even run faster from the quick little thumbdrive.

Windows need a few extra partitions on GPT to be happy. Actually UEFI itself requires a FAT32 partition to be able to boot (this is why the thumbdrive was needed in the first place).

In the Windows installer, I removed the Windows partition, and told Windows to install itself on the resulting "free space". It figured out the situation, and created the UEFI system partition (the stuff for booting wit the FAT filesystem), a partition for recovery, one MSR for it's private business, and a data partition for the actual filesystem mortals use for their everyday business.

Installation went smoothly, Windows finally booted up form the disk, in UEFI. Now was time to fix Ubuntu.

Installing GRUB 2 in UEFI mode

In Ubuntu, I had to create a /boot/efi mount point, and mount the UEFI system partition, uninstall the old grub2, and install grub-efi.

To install grub-efi, one has to be boot the system in UEFI mode. What?!

This chicken-and-egg problem can be solved with a recent Ubuntu desktop live CD by setting the UEFI setup to boot the DVD in UEFI mode. Yepp, Ubuntu's install CD can boot up in UEFI mode no problem, Windows can't. +1 for Ubuntu again.

To install grub, one has to mount the / partition under somewhere, like /mnt. After chroot-ing into this partition, /proc, /dev, /dev/pts, /sys and the aformentioned /boot partition have to be mounted with mount <mount point> commands. After this, a grub-install writes the necessary files onto the UEFI boot partition, and update-grub writes grub.cfg.

Chainloading the Windows UEFI boot loader from Grub-EFI

To be able to do this, I followed the advice at http://askubuntu.com/questions/193144/dual-boot-uefi-windows-7-and-ubuntu-12-04-both-64-bits-w7-entry-doesnt-appea. Basicly:

grub-probe --target=fs_uuid /boot/efi/efi/Microsoft/Boot/bootmgfw.efi

This will output an UUID. Take a note of this.
Add this to /etc/grub.d/40_custom:

menuentry "Windows 8" {
    search --fs-uuid --no-floppy --set=root UUID
   chainloader (${root})/efi/Microsoft/Boot/bootmgfw.efi
}

And run update-grub after this.

Fixing the boot order

Windows and Ubuntu create entries in the motherboard's NVRAM for their boot managers. You can select the default operation system at the UEFI setup (by quickly pressing something like DEL, or F1, or F2 when the system starts).

However, whenever Windows starts, it makes itself the default operating system in that list. So after every Windows session, Windows will start again by default on the next boot. Nice huh?

No, it isn't. Let's solve this.

According to Microsoft's support forum, it's impossible to make Windows to accept itself other than the first place on the boot list.

The solution is to create a batch file that runs at every bootup, and makes GRUB2 the default entry in the NVRAM.

This can be done by opening up the group policy editor (search this in Metro UI), and navigation to the startup scripts section.

The .bat file should look like this:

bcdedit /set {fwbootmgr} DEFAULT {appropriate UUID}

Where "appropriate UUID" is the UUID in GRUB, and can be read from the list that "bcedit /enum firmware".

So Windows will make itself default, just to undo int a bit later when this file runs. Not a pretty solution, but it works.

ERROR: Pentaho Data Integration (Kettle) process runs twice

2013-01-09T02:27:00.001-08:00

The Problem

I work a lot with Pentaho Data Integration a.k.a Kettle toolkit. For those who don't known: Kettle allows you to build processes with a GUI that can be run in the IDE or from the command line, and reads data, converts and transforms it, then spits it out. It can deal with a lot of databases and various file formats, can invoke shell scripts, can run JavaScript snippets, and perform various conversions and transformations. Very handy when you have to load large, broken CSV files into relational databases just to mention an example.

I recently re-designed one of our processes (or "jobs" in Kettle) when something really strange showed up during testing.

Part of the process run twice. It seemed like I'd duplicated the whole process from some point.

The IDE is basicly allows you to build a graph, the nodes are the process steps, the edges are telling what to do when a node was finished.

Clearly, beside error handling edges, I only draw a single edge from any node. The thing I saw could only happened If I would drew two edges between some of the two nodes. The IDE don't even allow such thing to happen. Or do it?

I tried to disable the edges (called "hops" in Kettle job terminology), to catch where the process goes two ways.

I found out that disabling one of the hops did NOT cause the job to stop at that point - which is the thing it should have done. Instead, the script run correctly, executing it's tail only once.

The Solution

Something was clearly off the rails. In the IDE, I disabled one hop, but the command line tool still thought that there is an enabled hop between the two steps. When I enabled the hop, the command line tool probably saw TWO hops between the same nodes - a thing that is impossible to (intentionally) achieve in the IDE (which is called Spoon btw). I was suspecting file corruption, and/or serious command line tool bug.

Kettle keeps the job description in XML files, therefore they can be opened and easily modified in a simple text editor. So this is exactly what I did: I fired up Geany, and opened the offending job file. This is what I saw:

...
    <hop>
      <from>Set up dimension tables in stage</from>
      <to>transform_fixlogs_to_stage</to>
      <from_nr>0</from_nr>
      <to_nr>0</to_nr>
      <enabled>Y</enabled>
      <evaluation>Y</evaluation>
      <unconditional>N</unconditional>
    </hop>
    <hop>
      <from>Set up dimension tables in stage</from>
      <to>transform_fixlogs_to_stage</to>
      <from_nr>0</from_nr>
      <to_nr>0</to_nr>
      <enabled>Y</enabled>
      <evaluation>Y</evaluation>
      <unconditional>N</unconditional>
    </hop>
...

Yep, there are TWO hops between the same nodes.

Removing one of the hops with the text editor solved the problem, the job now runs correctly.

Bash script that cannot be run more than once at the same time

2012-07-27T02:24:00.001-07:00

People dealing with cron, and bash scripts that might take a bit longer than they're supposed to often encounter the following behaviour.

Suppose that you launch something from cron in say every hour. The stuff usually completes in 10 minutes, but sometimes, when the load peaks, or network clogs, the process is running much slower. After an hour an other one is launched, further hogging the resources of the machine and possibly messing up data.

The solution for this is to pay attention not to run twice of course (and also to fix the underlying problem that causing the slowdowns).

Craig Andrews posted an almos working shell script in his blog (http://candrews.integralblue.com/2009/02/one-instance-at-a-time-with-pid-file-in-bash/)

I hereby shamelessly re-post the snippet. My only excuse is that I've fixed the typo causing the output of kill to be written into a file named 1. Well, I also added a few comments and hints. Anyway. Here it is:

#!/bin/bash

# Replace this with a meaningful, and unique filename
# You probably need root privileges to write /var/run.
# You can use any filename and path here for testing or
# for whatever reason.
pidfile=/var/run/your_solution_name.pid
if [ -e $pidfile ]; then
pid=`cat $pidfile`
if kill -0 2>&1 > /dev/null $pid; then
echo "Already running"
exit 1
else
rm -f $pidfile
fi
fi
echo $$ > $pidfile

# Do your stuff here.
# For testing purposes, we're just gonna
# sleep for 10 seconds. Try opening two
# terminal windows and launch the script
# in both one at the same time.
sleep 10

rm $pidfile

How NOT to write SLOW programs with python and numpy

2012-07-17T05:55:00.001-07:00

~~How to write fast programs with numpy and python~~

At first, I wanted to write a post about "how to write fast programs with python and numpy". After writing a few test cases it quickly turned out that the fast, numpy versions of tests are not only fast because of numpy, but because they use numpy "the right way". Let's see what does it mean.

The problem

I was writing a program that read a sound file, subtracted the right channel from the left (or the other way around, it doesn't matter), and played the result back as a single mono audio stream.

The point was to generate karaoke track out of a "normal" one by stripping the vocals. Since the vocals are often present equally and in-phase on both channels, after the subtraction no vocals was heard. Most of the time. This method has a drawback of ruining much of the other sounds in a track, but at that point I could live with that.

The slow solution

# -*- coding: utf-8 -*-

import wave
import pyaudio
import numpy

f = wave.open('e.wav')

samples = numpy.fromstring(f.readframes(
    f.getnframes()), dtype=numpy.int16)

print "Removing voice..."
S = numpy.empty(len(samples)/2, dtype=numpy.int16)
for i in xrange(len(samples)/2):
    S[i] = samples[2*i]/2 - samples[2*i+1]/2

f.close()

print "Playing..."

p = pyaudio.PyAudio()

stream = p.open(
    output_device_index = 0, format = pyaudio.paInt16,
    channels = 1, rate = 44100, input = False,
    output = True, frames_per_buffer = 4410
)

for i in xrange(len(samples)/44100):
    stream.write(S[i*4410:(i+1)*4410].tostring())

stream.close()

The fast solution

The infinitely faster solution only differed in a few lines after the line that prints "Removing voice...":

print "Removing voice..."
S = samples[0::2]/2 - samples[1::2]/2

print "Playing..."

Digging deeper

Using the built-in compile() function, and the wonderful dis module, let's examine what's happening in both cases.

Slow case:

>>> c = compile("for i in xrange(len(samples)): S[i] = samples[2*i]/2 - samples[2*i+1]/2", "dummy", "exec")
>>> dis.dis(c)
  1           0 SETUP_LOOP              68 (to 71)
              3 LOAD_NAME                0 (xrange)
              6 LOAD_NAME                1 (len)
              9 LOAD_NAME                2 (samples)
             12 CALL_FUNCTION            1
             15 CALL_FUNCTION            1
             18 GET_ITER            
        >>   19 FOR_ITER                48 (to 70)
             22 STORE_NAME               3 (i)
             25 LOAD_NAME                2 (samples)
             28 LOAD_CONST               0 (2)
             31 LOAD_NAME                3 (i)
             34 BINARY_MULTIPLY     
             35 BINARY_SUBSCR       
             36 LOAD_CONST               0 (2)
             39 BINARY_DIVIDE       
             40 LOAD_NAME                2 (samples)
             43 LOAD_CONST               0 (2)
             46 LOAD_NAME                3 (i)
             49 BINARY_MULTIPLY     
             50 LOAD_CONST               1 (1)
             53 BINARY_ADD          
             54 BINARY_SUBSCR       
             55 LOAD_CONST               0 (2)
             58 BINARY_DIVIDE       
             59 BINARY_SUBTRACT     
             60 LOAD_NAME                4 (S)
             63 LOAD_NAME                3 (i)
             66 STORE_SUBSCR        
             67 JUMP_ABSOLUTE           19
        >>   70 POP_BLOCK           
        >>   71 LOAD_CONST               2 (None)
             74 RETURN_VALUE

Fast case:

>>> import dis
>>> c = compile("S = samples[2::0]/2 - samples[2::1]/2", "dummy", "exec")
>>> dis.dis(c)
  1           0 LOAD_NAME                0 (samples)
              3 LOAD_CONST               0 (2)
              6 LOAD_CONST               1 (None)
              9 LOAD_CONST               2 (0)
             12 BUILD_SLICE              3
             15 BINARY_SUBSCR       
             16 LOAD_CONST               0 (2)
             19 BINARY_DIVIDE       
             20 LOAD_NAME                0 (samples)
             23 LOAD_CONST               0 (2)
             26 LOAD_CONST               1 (None)
             29 LOAD_CONST               3 (1)
             32 BUILD_SLICE              3
             35 BINARY_SUBSCR       
             36 LOAD_CONST               0 (2)
             39 BINARY_DIVIDE       
             40 BINARY_SUBTRACT     
             41 STORE_NAME               1 (S)
             44 LOAD_CONST               1 (None)
             47 RETURN_VALUE

There are differences that are quite obvious at first sight. The slow version has a loop that executes a bunch of operations like BINARY_ADD, BINARY_DIVIDE, etc. in each iteration, so there's O(n) python-related stuff to execute.

In the fast case, there are a fixed amount code to run, so we only have O(1) python overhead. The lion's share is written in C. Although it's easy to write slow code in C, numpy is certainly not slow.

The conclusion is that if you're programming with numpy, you're better avoid iterating over numpy arrays and doing stuff on them. Numpy has a great many built-in functions that cover everything came into my mind since I've started working with it. Reinventing the wheel is highly counterproductive.

Thrift / c_glib and Cassandra

2012-01-24T14:01:00.000-08:00

Thrift

Thrift is apache's tool. It can generate client / server codes based on a file written in it's own descriptor language.

At first I was thrilled how easy it'll be to write a Cassandra client with it: "you just have to generate the C files, #include them, call a few functions and it's done".

Yeah. Like anything in the world works like that. And this particular thing is no exception.

Thrift comes with a documentation that is... wait! It doesn't really comes with any documentation at all. The stuff that's in the package and / or scattered on the Net in the form of blog posts and bug reports is outdated and only can be used to prevent the enemy from using this great weapon.

My last expression isn't a sarcastic one, thrift would be great if I could wield it correctly.

Cassandra

Cassandra is noSQL database. It doesn't really matter now how it works exactly, it's enough if you know that one can store and fetch data with it, and can connect to it over the network.

Coincidentally, it uses Thrift to describe it's interface, so people of different sex, religion and programming language can generate their own interface libraries. First, I tried to put together a client in C++ based on this article. Cassandra, Thirft and gcc evolved somewhat since 2010, and / or I might be using an exotic combination of software (Ubuntu Oneiric), or the Gods might be angry at me for some strange reason, or I may be simply too dumb to follow a bit outdated tutorial solving a few problems along the way; anyway I could not get the code compiled.

C and GLib

I have much more experience with C than C++, so I decided to throw the C++ code away my co-worker has been writing, and start from scratch with C. I was prepared to read and interpret the Thrift interface descriptor file with my already melting brain, and write the C code myself.

As I started to work I discovered that Thrift CAN generate C interface libraries. It is a bit incomplete in 0.8.0, since it does not generate the server skeleton file; it didn't really matter for me.

I cd'd into Cassandra's interface directory and issued thrift -gen c_glib cassandra.thrift command, just to find the generated sources under gen-c_glib directory.

The sources was clean and readable despite the fact that they were auto-generated.

I even found a small example, and it compiled OK.

I had to replace a few lines, to work with Cassandra instead of the calculator example. The following is the re-write for Cassandra, with connecting to a server on localhost on the default port, and executing a query that fetches a value from keyspace "example", column family "examplecf", with key "foo", from the "bar" column. Error handling might be incomplete.

Warning: I don't know a thing a about glib, and I suspect that the code below is NOT the way to use it. It works here though. I maybe will improve it in the distant future.

#include 
#include 

#include "gen-c_glib/cassandra.h"
#include "protocol/thrift_protocol.h"
#include "protocol/thrift_binary_protocol.h"
#include "transport/thrift_framed_transport.h"
#include "transport/thrift_transport.h"
#include "transport/thrift_socket.h"

#include "gen-c_glib/cassandra.h"

int main(int argc, char** argv) {
  ThriftSocket *tsocket;
  ThriftTransport *transport;
  ThriftProtocol *protocol;
  CassandraClient *client;
  CassandraIf *service;
  InvalidRequestException *ire = NULL;
  NotFoundException *nfe = NULL;
  UnavailableException *ue = NULL;
  TimedOutException *te = NULL;
  ColumnOrSuperColumn *result;
  GError *error = NULL;

  GByteArray column = {
    .data = (unsigned char *)"bar",
    .len  = 3
  };

  ColumnPath *cp = NULL;
  
  GByteArray key = {
    .data = (unsigned char *)"foo",
    .len  = 3
  };
 
  g_type_init();

  tsocket = THRIFT_SOCKET(
    g_object_new(
      THRIFT_TYPE_SOCKET, "hostname",
      "localhost", "port", 9160, 0
    )
  );
  transport = THRIFT_TRANSPORT(
    g_object_new(
      THRIFT_TYPE_FRAMED_TRANSPORT, "transport", tsocket, 0
    )
  );
  protocol = THRIFT_PROTOCOL(
    g_object_new(
      THRIFT_TYPE_BINARY_PROTOCOL, "transport", transport, 0
    )
  );
  client = CASSANDRA_CLIENT(
    g_object_new(
      TYPE_CASSANDRA_CLIENT, "input_protocol",
      protocol, "output_protocol", protocol, 0
    )
  );
  service = CASSANDRA_IF(client);

  if (
    !thrift_transport_open(transport, 0) ||
    !thrift_transport_is_open(transport)
  ) {
          printf("Could not connect to server\n");
          return 1;
  }
  printf("Connected to cassandra at localhost:9160\n");

  cassandra_client_set_keyspace(
    service, "example", &ire, &error
  );
  if (ire) {
    printf("Invalid request exception: %s\n", ire->why);
    return 1;
  }
  if (error) {
    printf("An error has occured\n");
    return 1;
  }
  printf("Selected keyspace example\n");

  cp = g_object_new(TYPE_COLUMN_PATH, 0);
  cp->column_family = "examplecf";
  cp->column = &column;
  cp->__isset_column = TRUE;

  cassandra_client_get(
    service, &result, &key, cp, CONSISTENCY_LEVEL_QUORUM,
    &ire, &nfe, &ue, &te, &error
  );

  if (ire) {
    printf("Invalid request exception: %s\n", ire->why);
    return 1;
  }
  if (nfe) {
    printf("Row not found\n");
    return 1;
  }
  if (ue) {
    printf("Unavailable exception\n");
    return 1;
  }
  if (te) {
    printf("Timed out exception\n");
    return 1;
  }
  if (error) {
    printf("An error has occured\n");
    return 1;
  }
  
  printf(
    "The result is %s\n",
    strndup(
      (char *)result->column->value->data,
      result->column->value->len
    )
  );

  /* Don't forget to free resources if
   * your program runs longer than this */

  return 0;
}

I compiled thecode with the following commands:

gcc -c `pkg-config --cflags thrift_c_glib` test.c -o test.o

gcc -c `pkg-config --cflags thrift_c_glib`\

gen-c_glib/cassandra.c -o cassandra.o

gcc -c `pkg-config --cflags thrift_c_glib`\

 gen-c_glib/cassandra_types.c -o cassandra_types.o 

libtool --tag=CC --mode=link gcc `pkg-config --libs thrift_c_glib` -o test test.o cassandra.o cassandra_types.o

The last command is even more cryptic then the others, so here's the explanation:

The pkg-config command is used to query for compilation flags of program that use installed libraries. It's the library's make install script's responsibility to install this info. If a package is installed from the repository of your distribution, this information is installed by the package manager. The rest of the command line should be clear.

UPDATE:

Note that the key, column name and value does NOT contain the trailing zero byte.