Firewalls are an important tool that can be configured to protect your servers and infrastructure. In the Linux ecosystem, iptables
is a widely used firewall tool that works with the kernel’s netfilter
packet filtering framework. Creating reliable firewall policies can be daunting, due to complex syntax and the number of interrelated parts involved.
In this guide, we will dive into the iptables
architecture with the aim of making it more comprehensible for users who need to build their own firewall policies. We will discuss how iptables
interacts with netfilter
and how the various components fit together to provide a comprehensive filtering system.
For many years, the firewall software most commonly used in Linux was called iptables
. In some distributions, it has been replaced by a new tool called nftables
, but iptables
syntax is still commonly used as a baseline. The iptables
firewall works by interacting with the packet filtering hooks in the Linux kernel’s networking stack. These kernel hooks are known as the netfilter
framework.
Every packet that passes through the networking layer (incoming or outgoing) will trigger these hooks, allowing programs to interact with the traffic at key points. The kernel modules associated with iptables
register with these hooks in order to ensure that the traffic conforms to the conditions laid out by the firewall rules.
There are five netfilter
hooks that programs can register with. As packets progress through the stack, they will trigger the kernel modules that have registered with these hooks. The hooks that a packet will trigger depends on whether the packet is incoming or outgoing, the packet’s destination, and whether the packet was dropped or rejected at a previous point.
The following hooks represent these well-defined points in the networking stack:
NF_IP_PRE_ROUTING
: This hook will be triggered by any incoming traffic very soon after entering the network stack. This hook is processed before any routing decisions have been made regarding where to send the packet.NF_IP_LOCAL_IN
: This hook is triggered after an incoming packet has been routed if the packet is destined for the local system.NF_IP_FORWARD
: This hook is triggered after an incoming packet has been routed if the packet is to be forwarded to another host.NF_IP_LOCAL_OUT
: This hook is triggered by any locally created outbound traffic as soon as it hits the network stack.NF_IP_POST_ROUTING
: This hook is triggered by any outgoing or forwarded traffic after routing has taken place and just before being sent out on the wire.Kernel modules that need to register at these hooks must also provide a priority number to help determine the order in which they will be called when the hook is triggered. This provides the means for multiple modules (or multiple instances of the same module) to be connected to each of the hooks with deterministic ordering. Each module will be called in turn and will return a decision to the netfilter
framework after processing that indicates what should be done with the packet.
The iptables
firewall uses tables to organize its rules. These tables classify rules according to the type of decisions they are used to make. For instance, if a rule deals with network address translation, it will be put into the nat
table. If the rule is used to decide whether to allow the packet to continue to its destination, it would probably be added to the filter
table.
Within each iptables
table, rules are further organized within separate “chains”. While tables are defined by the general aim of the rules they hold, the built-in chains represent the netfilter
hooks which trigger them. Chains determine when rules will be evaluated.
The names of the built-in chains mirror the names of the netfilter
hooks they are associated with:
PREROUTING
: Triggered by the NF_IP_PRE_ROUTING
hook.INPUT
: Triggered by the NF_IP_LOCAL_IN
hook.FORWARD
: Triggered by the NF_IP_FORWARD
hook.OUTPUT
: Triggered by the NF_IP_LOCAL_OUT
hook.POSTROUTING
: Triggered by the NF_IP_POST_ROUTING
hook.Chains allow the administrator to control where in a packet’s delivery path a rule will be evaluated. Since each table has multiple chains, a table’s influence can be exerted at multiple points in processing. Because certain types of decisions only make sense at certain points in the network stack, every table will not have a chain registered with each kernel hook.
There are only five netfilter
kernel hooks, so chains from multiple tables are registered at each of the hooks. For instance, three tables have PREROUTING
chains. When these chains register at the associated NF_IP_PRE_ROUTING
hook, they specify a priority that dictates what order each table’s PREROUTING
chain is called. Each of the rules inside the highest priority PREROUTING
chain is evaluated sequentially before moving onto the next PREROUTING
chain. We will take a look at the specific order of each chain in a moment.
Let’s step back for a moment and take a look at the different tables that iptables
provides. These represent distinct sets of rules, organized by area of concern, for evaluating packets.
The filter table is one of the most widely used tables in iptables
. The filter
table is used to make decisions about whether to let a packet continue to its intended destination or to deny its request. In firewall parlance, this is known as “filtering” packets. This table provides the bulk of functionality that people think of when discussing firewalls.
The nat
table is used to implement network address translation rules. As packets enter the network stack, rules in this table will determine whether and how to modify the packet’s source or destination addresses in order to impact the way that the packet and any response traffic are routed. This is often used to route packets to networks when direct access is not possible.
The mangle
table is used to alter the IP headers of the packet in various ways. For instance, you can adjust the TTL (Time to Live) value of a packet, either lengthening or shortening the number of valid network hops the packet can sustain. Other IP headers can be altered in similar ways.
This table can also place an internal kernel “mark” on the packet for further processing in other tables and by other networking tools. This mark does not touch the actual packet, but adds the mark to the kernel’s representation of the packet.
The iptables
firewall is stateful, meaning that packets are evaluated in regards to their relation to previous packets. The connection tracking features built on top of the netfilter
framework allow iptables
to view packets as part of an ongoing connection or session instead of as a stream of discrete, unrelated packets. The connection tracking logic is usually applied very soon after the packet hits the network interface.
The raw
table has a very narrowly defined function. Its only purpose is to provide a mechanism for marking packets in order to opt-out of connection tracking.
The security
table is used to set internal SELinux security context marks on packets, which will affect how SELinux or other systems that can interpret SELinux security contexts handle the packets. These marks can be applied on a per-packet or per-connection basis.
If three tables have PREROUTING
chains, in which order are they evaluated?
The following table indicates the chains that are available within each iptables
table when read from left-to-right. For instance, we can tell that the raw
table has both PREROUTING
and OUTPUT
chains. When read from top-to-bottom, it also displays the order in which each chain is called when the associated netfilter
hook is triggered.
A few things should be noted. In the representation below, the nat
table has been split between DNAT
operations (those that alter the destination address of a packet) and SNAT
operations (those that alter the source address) in order to display their ordering more clearly. We have also include rows that represent points where routing decisions are made and where connection tracking is enabled in order to give a more holistic view of the processes taking place:
Tables↓/Chains→ | PREROUTING | INPUT | FORWARD | OUTPUT | POSTROUTING |
---|---|---|---|---|---|
(routing decision) | ✓ | ||||
raw | ✓ | ✓ | |||
(connection tracking enabled) | ✓ | ✓ | |||
mangle | ✓ | ✓ | ✓ | ✓ | ✓ |
nat (DNAT) | ✓ | ✓ | |||
(routing decision) | ✓ | ✓ | |||
filter | ✓ | ✓ | ✓ | ||
security | ✓ | ✓ | ✓ | ||
nat (SNAT) | ✓ | ✓ |
As a packet triggers a netfilter
hook, the associated chains will be processed as they are listed in the table above from top-to-bottom. The hooks (columns) that a packet will trigger depend on whether it is an incoming or outgoing packet, the routing decisions that are made, and whether the packet passes filtering criteria.
Certain events will cause a table’s chain to be skipped during processing. For instance, only the first packet in a connection will be evaluated against the NAT rules. Any nat
decisions made for the first packet will be applied to all subsequent packets in the connection without additional evaluation. Responses to NAT’ed connections will automatically have the reverse NAT rules applied to route correctly.
Assuming that the server knows how to route a packet and that the firewall rules permit its transmission, the following flows represent the paths that will be traversed in different situations:
PREROUTING
-> INPUT
PREROUTING
-> FORWARD
-> POSTROUTING
OUTPUT
-> POSTROUTING
If we combine the above information with the ordering laid out in the previous table, we can see that an incoming packet destined for the local system will first be evaluated against the PREROUTING
chains of the raw
, mangle
, and nat
tables. It will then traverse the INPUT
chains of the mangle
, filter
, security
, and nat
tables before finally being delivered to the local socket.
Rules are placed within a specific chain of a specific table. As each chain is called, the packet in question will be checked against each rule within the chain in order. Each rule has a matching component and an action component.
The matching portion of a rule specifies the criteria that a packet must meet in order for the associated action (or “target”) to be executed.
The matching system is very flexible and can be expanded significantly with additional iptables
extensions. Rules can be constructed to match by protocol type, destination or source address, destination or source port, destination or source network, input or output interface, headers, or connection state among other criteria. These can be combined to create complex rule sets to distinguish between different traffic.
A “target” refers to the actions that are triggered when a packet meets the matching criteria of a rule. Targets are generally divided into two categories:
netfilter
hook. Depending on the return value provided, the hook might drop the packet or allow the packet to continue to the next stage of processing.The availability of each target within rules will depend on context. For instance, the table and chain type might dictate the targets available. The extensions activated in the rule and the matching clauses can also affect the availability of targets.
There is also a special class of non-terminating target: the jump target. Jump targets are actions that result in evaluation moving to a different chain for additional processing. We’ve covered the built-in chains which are tied to the netfilter
hooks that call them. However, iptables
also allows administrators to create their own chains for organizational purposes.
Rules can be placed in user-defined chains in the same way that they can be placed into built-in chains. The difference is that user-defined chains can only be reached by “jumping” to them from a rule (they are not registered with a netfilter
hook themselves).
User-defined chains act as extensions of the chain which called them. For instance, in a user-defined chain, evaluation will pass back to the calling chain if the end of the rule list is reached or if a RETURN
target is activated by a matching rule. Evaluation can also jump to additional user-defined chains.
This construct allows for greater organization and provides the framework necessary for more robust branching.
We introduced the connection tracking system implemented on top of the netfilter
framework when we discussed the raw
table and connection state matching criteria. Connection tracking allows iptables
to make decisions about packets viewed in the context of an ongoing connection. The connection tracking system provides iptables
with the functionality it needs to perform “stateful” operations.
Connection tracking is applied very soon after packets enter the networking stack. The raw
table chains and some sanity checks are the only logic that is performed on packets prior to associating the packets with a connection.
The system checks each packet against a set of existing connections. It will update the state of the connection in its store if needed and will add new connections to the system when necessary. Packets that have been marked with the NOTRACK
target in one of the raw
chains will bypass the connection tracking routines.
Connections tracked by the connection tracking system will be in one of the following states:
NEW
: When a packet arrives that is not associated with an existing connection, but is not invalid as a first packet, a new connection will be added to the system with this label. This happens for both connection-aware protocols like TCP and for connectionless protocols like UDP.ESTABLISHED
: A connection is changed from NEW
to ESTABLISHED
when it receives a valid response in the opposite direction. For TCP connections, this means a SYN/ACK
and for UDP and ICMP traffic, this means a response where source and destination of the original packet are switched.RELATED
: Packets that are not part of an existing connection, but are associated with a connection already in the system are labeled RELATED
. This could mean a helper connection, as is the case with FTP data transmission connections, or it could be ICMP responses to connection attempts by other protocols.INVALID
: Packets can be marked INVALID
if they are not associated with an existing connection and aren’t appropriate for opening a new connection, if they cannot be identified, or if they aren’t routable among other reasons.UNTRACKED
: Packets can be marked as UNTRACKED
if they’ve been targeted in a raw
table chain to bypass tracking.SNAT
: This is a virtual state set when the source address has been altered by NAT operations. This is used by the connection tracking system so that it knows to change the source addresses back in reply packets.DNAT
: This is a virtual state set when the destination address has been altered by NAT operations. This is used by the connection tracking system so that it knows to change the destination address back when routing reply packets.The states tracked in the connection tracking system allow administrators to craft rules that target specific points in a connection’s lifetime. This provides the functionality needed for more thorough and secure rules.
The netfilter
packet filtering framework and the iptables
firewall are the basis for most firewall solutions on Linux servers. The netfilter
kernel hooks are close enough to the networking stack to provide powerful control over packets as they are processed by the system. The iptables
firewall leverages these capabilities to provide a flexible, extensible method of communicating policy requirements to the kernel. By learning about how these pieces fit together, you can better utilize them to control and secure your server environments.
If you would like to know more about how to choose effective iptables
policies, check out this guide.
These guides can help you get started implementing your iptables
firewall rules:
Thanks for learning with the DigitalOcean Community. Check out our offerings for compute, storage, networking, and managed databases.
This textbox defaults to using Markdown to format your answer.
You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!
Can you pls setup the blog on how to insert the user defined non standard transport protocol in linux kernel, covering all the steps ?
Thank you for the presentation of the nitty-gritty of iptables & netfilter.
I would like to know how netfilter handles the result of the hooked modules.
As you explain (At the end of Netfilter Hooks), every module return an a decision to netfilter. As netfilter call all modules in turn, how does it makes the final decision?
Thank.
Thx a lot explaining it in a more generic manner 🙂. Understanding hardware vendors’ documentation is often quite difficult to read.
I would like to know what happens to establish connections, do they have to go through the entire decision tree again?
Amazing, thank you!
@jellingwood the sentence is not correct。
Locally generated packets:
OUTPUT
->POSTROUTING
because i can block 127.0.0.1:80 from source ip 127.0.0.1. So, the right flow is:
Locally generated packets:
INPUT
->OUTPUT
->POSTROUTING