HOWTO use NS LOG to find a problem

Main Page - Roadmap - Summer Projects - Project Ideas - Developer FAQ - Tools - Related Projects

HOWTOs - Installation - Troubleshooting - User FAQ - Samples - Models - Education - Contributed Code - Papers

One of the easiest ways a person can narrow down the location of a problem in ns-3 is by using NS_LOG and our predefined logging components. Typically one uses NS_LOG in a case where the exact nature of the problem is not yet understood. Logging provides a way to narrow down the problem location to an extent where either the log message will indicate the source of the problem orit will be easy to fire up the script in a debugger and take a closer look.

A tried and true method for quickly locating a problem is via binary search. You can use binary search to locate a bug just as easily as you can use binary search to locate an item in the usual sense. In the case of debugging a network simulation script, you will usually first consider the end-to-end path of a packet and then take a look at what is happening half-way. If the problem is visible at the half-way point, you think of what is half-way to that point and take a look there. If the problem is not manifested at the halfway point, you take a look at what is happening half-way again towards the end point. Using this technique, you can quickly narrow down the location of a problem.

Of course, you can do this in a debugger just as easily as with NS_LOG if you know exactly where to set your breakpoints a priori. An interesting feature of NS_LOG is that you can really do the same thing without knowing what is going on other than in a general network knowledge sense.

In this HOWTO, I demonstrate how one could use the binary search technique to locate a hypothetical problem in the our first.cc example. As a bonus, look at the end for how to use NS_LOG as a learning tool for understanding the system.

HOWTO use NS LOG to find a problem

The first.cc example uses an echo client on a single client node and an echo server on a single server node. It orchesrates an echo across a point-to-point link. A moment's thought will tell you that the halfway point in this simulation is at the point-to-point link. The ns-3 model is that there is a net device and a channel so you will want to enable logging for the point-to-point channel.

If you know something about the structure of the source, you'll just know to look in

 src/point-to-point/model/point-to-point-channel.cc

for the NS_LOG_COMPONENT_DEFINE of the logging component. If you don't have that kind of information rattling around in your brain you can just go to the top level directory and do

 find . -name '*.cc' | xargs grep NS_LOG_COMPONENT_DEFINE

This might seem like brute force and awkwardness, but you'll only find a couple of pages of them. If you really don't want to wade through the two pages, you can also narrow the search

 find src/point-to-point -name '*.cc' | xargs grep NS_LOG_COMPONENT_DEFINE | grep -i point

and you'll currently see

  src/point-to-point/helper/point-to-point-helper.cc:NS_LOG_COMPONENT_DEFINE("PointToPointHelper");
  src/point-to-point/examples/main-attribute-value.cc:NS_LOG_COMPONENT_DEFINE("AttributeValueSample");
  src/point-to-point/model/point-to-point-channel.cc:NS_LOG_COMPONENT_DEFINE("PointToPointChannel");
  src/point-to-point/model/ppp-header.cc:NS_LOG_COMPONENT_DEFINE("PppHeader");
  src/point-to-point/model/point-to-point-remote-channel.cc:NS_LOG_COMPONENT_DEFINE("PointToPointRemoteChannel");
  src/point-to-point/model/point-to-point-net-device.cc:NS_LOG_COMPONENT_DEFINE("PointToPointNetDevice");

You can easily see that the log component to enable is, not too surprisingly, PointToPointChannel.

Go ahead and enable the log component and run the program to see what's happening.

 export NS_LOG=PointToPointChannel
 ./ns3 run first

You'll immediately see that the packet made it down to the channel from the client.

 +2.003686400s 1 PointToPointChannel:TransmitStart(0x5607a7e72250, 0x5607a7ea1520, 0x5607a7e9c410)
 +2.003686400s 1 PointToPointChannel:TransmitStart(): [LOGIC] UID is 0)

Sidebar:

Once you know the name of the method called, it is nice to be able to find a description of the function you've seen. This is easy to do in our Doxygen. Take a look at https://www.nsnam.org/docs/release/3.40/doxygen/index.html (if you are working with a released version) or https://www.nsnam.org/doxygen/index.html (if you are working with the development version). Expand Class List in the navigation pane on the left side of the page. You will see a bunch of classes. Search for ns3::PointToPointChannel and click on the link. You will have linked to the class reference page for the PointToPointChannel. If you look at the Public Member Functions documentation on this page you will see TransmitStart, which tells you that this method is called to transmit a packet over this channel.

Just interpreting the name found in the log as a class name and method name often works, but notice that the function call log printed in the snippet above looks like a class name followed by a single colon and then a method name and parameters. This is not a typo. The single colon is easily overlooked but has a significant meaning. The single colon separator means that what appears to be a class name is actually a log component name. In ns-3 these are often the same thing, but not necessarily. To find the code for TransmitStart you could use the recursive find trick looking for PointToPointChannel::TransmitStart (note the double colon namespace separator in the search term) and you would find it in this case. This is just because the class name and the log component name are identical. Generally that's the easiest thing to do first as was done above. However, if you don't find it this way you will have to search for the log component name and then search for the method name in the .cc file you find. This will give you the real class name and you can use that class name to find the method documentation in doxygen.

If you haven't yet found the documentation for the logging component itself, go ahead and expand Modules and then Core and then Debugging in the navigation pane of the ns-3 doxygen, and then select Logging to go to the low-level documentation. If you are unfamilar with this and are interested in a tutorial introduction to logging, there is a section in the ns-3 tutorial (http://www.nsnam.org/docs/release/tutorial.html) called Using the Logging Module which you should read.

Now back to the main thread ...

So, now you can infer that a packet has made it from the echo client application down through the protocol stack on the client node, into the net device and has begun being transmitted to the server over the channel. Now, think to yourself, what's halfway up the protocol stack on the server side? Well, how about UDP. Again, the question is, what log component to turn on? The answer if you don't know anything about what's happening in the system is grep, but this time use your general networking knowledge and look for udp.

 find src -name '*.cc' | xargs grep NS_LOG_COMPONENT_DEFINE | grep -i udp

You'll see that we have the following log components containing the case-insensitive string udp:

 src/applications/model/udp-client.cc:NS_LOG_COMPONENT_DEFINE("UdpClient");
 src/applications/model/udp-echo-server.cc:NS_LOG_COMPONENT_DEFINE("UdpEchoServerApplication");
 src/applications/model/udp-server.cc:NS_LOG_COMPONENT_DEFINE("UdpServer");
 src/applications/model/udp-echo-client.cc:NS_LOG_COMPONENT_DEFINE("UdpEchoClientApplication");
 src/applications/model/udp-trace-client.cc:NS_LOG_COMPONENT_DEFINE("UdpTraceClient");
 src/fd-net-device/examples/fd-emu-udp-echo.cc:NS_LOG_COMPONENT_DEFINE("EmulatedUdpEchoExample");
 src/click/examples/nsclick-udp-client-server-wifi.cc:NS_LOG_COMPONENT_DEFINE("NsclickUdpClientServerWifi");
 src/click/examples/nsclick-udp-client-server-csma.cc:NS_LOG_COMPONENT_DEFINE("NsclickUdpClientServerCsma");
 src/internet/model/udp-l4-protocol.cc:NS_LOG_COMPONENT_DEFINE("UdpL4Protocol");
 src/internet/model/udp-socket.cc:NS_LOG_COMPONENT_DEFINE("UdpSocket");
 src/internet/model/udp-socket-impl.cc:NS_LOG_COMPONENT_DEFINE("UdpSocketImpl");

You can really play this by ear. How about turning on UdpL4Protocol. You know what an L4 protocol is, right? UdpSocketImpl or UdpSocket sound kindof high in the stack, so as a wild guess just turn on UdpL4Protocol.

 export NS_LOG=UdpL4Protocol
 ./ns3 run first

Among other things, you'll see

 At time +2.00369s server sent 1024 bytes to 10.1.1.1 port 49153
 +2.007372800s 0 UdpL4Protocol:Receive(0x5626c1862630, 0x5626c185b7a0, tos 0x0 DSCP Default ECN Not-ECT ttl 64 id 0 protocol 17 offset (bytes) 0 flags [none] length: 1052 10.1.1.2 > 10.1.1.1)

This tells you that a packet from 10.1.1.1 to 10.1.1.2 has made it as far as the UdpL4Protocol::Receive method (which you might not have known existed) and also gives you a method name that you can search for and begin seeing what is happening. Again, grep is your friend

 find src -name '*.cc' | xargs grep UdpL4Protocol::Receive

and you will see

 src/internet/model/udp-l4-protocol.cc:UdpL4Protocol::Receive(Ptr<Packet> packet, const Ipv4Header& header, Ptr<Ipv4Interface> interface)

Now you have a file and a method so your knowledge of what is happening is increasing. You can spend some time perusing the file if you like just to get a handle on what is happening at this poin in the stack.

Focusing at the problem at hand, you know know that the packet has made it about three quarters of the way from the client to the server. The find and grep for udp above tells you that you really only have the udp socket left between you and the server application. The UdpSocketImpl component sounds like a good place to look, so turn it on.

 export NS_LOG=UdpSocketImpl
 ./ns3 run first

You will see

 +2.007372800s 0 UdpSocketImpl:ForwardUp(0x5573c5d8dfc0, 0x5573c5d87380, tos 0x0 DSCP Default ECN Not-ECT ttl 64 id 0 protocol 17 offset (bytes) 0 flags [none] length: 1052 10.1.1.2 > 10.1.1.1, 9)
 +2.007372800s 0 UdpSocketImpl:RecvFrom(0x5573c5d8dfc0, 4294967295, 0)
 At time +2.00737s client received 1024 bytes from 10.1.1.2 port 9

The socket gets the packet from 10.1.1.1 and then forwards it up the stack and eventually wants to call the recv function which eventually gives the server application the data.

Anyway, I think you get the picture. Binary search for problems using NS_LOG. While you are doing the binary search, look at what is being displayed to help you understand what is going on in the stack. Use grep and find to locate the methods you see and then go read the code to get a better understanding of the system. If you want to understand what *should* be happening, do this exercise using one of the examples and watch the packet flow across a similar case and then go back to your code and see where it diverges from what you saw in the example.

Once you have run the binary search down to the finest granularity using NS_LOG, you can fire up your debugger and zero right in on the problem to within a few method calls. Of course, in many cases, something you did or did not do becomes obvious and you just go fix up your script and move on never having had to fire up gdb or ddd at all.

When everything else fails

When everything else fails, namely, when you have absolutely no clue what is the component that makes the whole program stop, you can try and turn on all the logging at once. To do this simply set the NS_LOG environment variable as follows:

export 'NS_LOG=*=level_all|prefix_func|prefix_time'

The * wildcard expands to all the LogComponents defined in ns-3, while level_all is the equivalent of LOG_LEVEL_ALL in the main simulation script. As a consequence, it is quite clear that this represents an extreme measure that produces tons of output, and as such, should be used as a last resort.

HOWTO use NS LOG as a learning tool

You can also use NS_LOG as a learning tool to dig down into the system. If you want to figure out more about ns-3 routing, one way is to find some routing log components and find out where $#!^ happens.

 find src -name '*.cc' | xargs grep NS_LOG_COMPONENT_DEFINE | grep -i routing

 src/aodv/model/aodv-rtable.cc:NS_LOG_COMPONENT_DEFINE("AodvRoutingTable");
 src/aodv/model/aodv-routing-protocol.cc:NS_LOG_COMPONENT_DEFINE("AodvRoutingProtocol");
 src/dsr/model/dsr-routing.cc:NS_LOG_COMPONENT_DEFINE("DsrRouting");
 src/olsr/model/olsr-routing-protocol.cc:NS_LOG_COMPONENT_DEFINE("OlsrRoutingProtocol");
 src/mesh/model/mesh-l2-routing-protocol.cc:NS_LOG_COMPONENT_DEFINE("MeshL2RoutingProtocol");
 src/click/examples/nsclick-routing.cc:NS_LOG_COMPONENT_DEFINE("NsclickRouting");
 src/click/examples/nsclick-defines.cc:NS_LOG_COMPONENT_DEFINE("NsclickRouting");
 src/click/model/ipv4-click-routing.cc:NS_LOG_COMPONENT_DEFINE("Ipv4ClickRouting");
 src/internet/helper/ipv6-static-routing-helper.cc:NS_LOG_COMPONENT_DEFINE("Ipv6StaticRoutingHelper");
 src/internet/helper/ipv4-static-routing-helper.cc:NS_LOG_COMPONENT_DEFINE("Ipv4StaticRoutingHelper");
 src/internet/helper/ipv4-global-routing-helper.cc:NS_LOG_COMPONENT_DEFINE("GlobalRoutingHelper");
 src/internet/test/ipv4-global-routing-test-suite.cc:NS_LOG_COMPONENT_DEFINE("Ipv4GlobalRoutingTestSuite");
 src/internet/model/ipv4-routing-table-entry.cc:NS_LOG_COMPONENT_DEFINE("Ipv4RoutingTableEntry");
 src/internet/model/ipv6-list-routing.cc:NS_LOG_COMPONENT_DEFINE("Ipv6ListRouting");
 src/internet/model/ipv4-static-routing.cc:NS_LOG_COMPONENT_DEFINE("Ipv4StaticRouting");
 src/internet/model/ipv4-global-routing.cc:NS_LOG_COMPONENT_DEFINE("Ipv4GlobalRouting");
 src/internet/model/ipv6-static-routing.cc:NS_LOG_COMPONENT_DEFINE("Ipv6StaticRouting");
 src/internet/model/ipv4-list-routing.cc:NS_LOG_COMPONENT_DEFINE("Ipv4ListRouting");
 src/internet/model/ipv4-routing-protocol.cc:NS_LOG_COMPONENT_DEFINE("Ipv4RoutingProtocol");
 src/nix-vector-routing/examples/nix-simple.cc:NS_LOG_COMPONENT_DEFINE("NixSimpleExample");
 src/nix-vector-routing/examples/nms-p2p-nix.cc:NS_LOG_COMPONENT_DEFINE("CampusNetworkModel");
 src/nix-vector-routing/examples/nix-double-wifi.cc:NS_LOG_COMPONENT_DEFINE("NixDoubleWifiExample");
 src/nix-vector-routing/examples/nix-simple-multi-address.cc:NS_LOG_COMPONENT_DEFINE("NixSimpleMultiAddressExample");
 src/nix-vector-routing/model/nix-vector-routing.cc:NS_LOG_COMPONENT_DEFINE("NixVectorRouting");
 src/dsdv/model/dsdv-routing-protocol.cc:NS_LOG_COMPONENT_DEFINE("DsdvRoutingProtocol");
 src/dsdv/model/dsdv-rtable.cc:NS_LOG_COMPONENT_DEFINE("DsdvRoutingTable");

Look in examples for a file with the root of the word "routing" in its name.

 find examples -name '*.cc' | xargs grep routing

As of this writing, you'll see 205 matches.

Why not take a look at something with simple in its name too. It seems reasonable that the log component Ipv4GlobalRouting will have something to do with the example file simple-global-routing.cc doesn't it? See what happens:

 export NS_LOG=Ipv4GlobalRouting
 ./ns3 run simple-global-routing

When you run this, you will get lots and lots of log messages. You can redirect the run output to a file and you can just start poking around.

 ./ns3 run simple-global-routing > log-output.txt 2>&1

Take a look at some output. This is interesting setup information

 +0.000000000s -1 Ipv4GlobalRouting:AddNetworkRouteTo(0x56123b8f0060, 0.0.0.0, 0.0.0.0, 10.1.1.2, 1)
 +0.000000000s -1 Ipv4GlobalRouting:AddNetworkRouteTo(0x56123b90c880, 0.0.0.0, 0.0.0.0, 10.1.2.2, 1)
 +0.000000000s -1 Ipv4GlobalRouting:AddHostRouteTo(0x56123b90f380, 10.1.1.1, 10.1.1.1, 1)
 +0.000000000s -1 Ipv4GlobalRouting:AddHostRouteTo(0x56123b90f380, 10.1.2.1, 10.1.2.1, 2)
 +0.000000000s -1 Ipv4GlobalRouting:AddHostRouteTo(0x56123b90f380, 10.1.3.1, 10.1.3.1, 3)
 +0.000000000s -1 Ipv4GlobalRouting:AddNetworkRouteTo(0x56123b90f380, 10.1.1.0, 255.255.255.0, 10.1.1.1, 1)
 +0.000000000s -1 Ipv4GlobalRouting:AddNetworkRouteTo(0x56123b90f380, 10.1.2.0, 255.255.255.0, 10.1.2.1, 2)
 +0.000000000s -1 Ipv4GlobalRouting:AddNetworkRouteTo(0x56123b90f380, 10.1.3.0, 255.255.255.0, 10.1.3.1, 3)
 +0.000000000s -1 Ipv4GlobalRouting:AddNetworkRouteTo(0x56123b90ce60, 0.0.0.0, 0.0.0.0, 10.1.3.2, 1)

Further down notice that the timestamps seem to be grouped.

 +8.002857035s 3 Ipv4GlobalRouting:LookupGlobal(0x56123b90ce60, 10.1.2.1, 0)
 +8.002857035s 3 Ipv4GlobalRouting:LookupGlobal(): [LOGIC] Looking for route for destination 10.1.2.1
 +8.002857035s 3 Ipv4GlobalRouting:LookupGlobal(): [LOGIC] Number of m_hostRoutes = 0
 +8.002857035s 3 Ipv4GlobalRouting:LookupGlobal(): [LOGIC] Number of m_networkRoutes1
 +8.002857035s 3 Ipv4GlobalRouting:LookupGlobal(): [LOGIC] 1Found global network route0x56123b9046b0

You can probably already get a sense for what must be going on. You can now go and find the code using grep.

 find src -name '*.cc' | xargs grep Ipv4GlobalRouting::LookupGlobal
 src/internet/model/ipv4-global-routing.cc:Ipv4GlobalRouting::LookupGlobal(Ipv4Address dest, Ptr<NetDevice> oif)

tells you where to look for some of the global routing-related code. From the log messages you can probably infer the basic operation before you even go look at the code.

You can also see (in the original find) that there are other components GlobalRouteManager and GlobalRouter -- you can turn on those log components to see what is happening there. Armed with some of this basic contextural information you can also look in the manual where you will find a routing chapter with detailed descriptions of some of the methods you've seen already. Using NS_LOG you can see the API described in the manual at work and follow what is happening in real code.

Anyway, NS_LOG is really a very powerful tool. Most people tend to under-appreciate it since they learned in programming 101 that debugging with printfs is for kids. Don't kid yourself, though. Use every tool available to you. Debugging with printfs in an intelligent way can make your life much easier.

Craigdo 02:39, 14 May 2009 (UTC)

HOWTO use NS LOG to find a problem

HOWTO use NS LOG to find a problem

When everything else fails

HOWTO use NS LOG as a learning tool

Navigation menu

Search