Linux Networking and Network Devices APIs

This documentation is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA

For more details see the file COPYING in the source distribution of Linux.


Table of Contents

1. Linux Networking
Networking Base Types
Socket Buffer Functions
Socket Filter
Generic Network Statistics
SUN RPC subsystem
WiMAX
2. Network device support
Driver Support
PHY Support

Chapter 1. Linux Networking

Networking Base Types

Name

enum sock_type — Socket types

Synopsis

enum sock_type {
  SOCK_STREAM,
  SOCK_DGRAM,
  SOCK_RAW,
  SOCK_RDM,
  SOCK_SEQPACKET,
  SOCK_DCCP,
  SOCK_PACKET
};  

Constants

SOCK_STREAM

stream (connection) socket

SOCK_DGRAM

datagram (conn.less) socket

SOCK_RAW

raw socket

SOCK_RDM

reliably-delivered message

SOCK_SEQPACKET

sequential packet socket

SOCK_DCCP

Datagram Congestion Control Protocol socket

SOCK_PACKET

linux specific way of getting packets at the dev level. For writing rarp and other similar things on the user level.

Description

When adding some new socket type please grep ARCH_HAS_SOCKET_TYPE include/asm-* /socket.h, at least MIPS overrides this enum for binary compat reasons.


Name

struct socket — general BSD socket

Synopsis

struct socket {
  socket_state state;
  short type;
  unsigned long flags;
  struct fasync_struct * fasync_list;
  wait_queue_head_t wait;
  struct file * file;
  struct sock * sk;
  const struct proto_ops * ops;
};  

Members

state

socket state (SS_CONNECTED, etc)

type

socket type (SOCK_STREAM, etc)

flags

socket flags (SOCK_ASYNC_NOSPACE, etc)

fasync_list

Asynchronous wake up list

wait

wait queue for several uses

file

File back pointer for gc

sk

internal networking protocol agnostic socket representation

ops

protocol specific socket operations

Socket Buffer Functions

Name

struct skb_shared_hwtstamps — hardware time stamps

Synopsis

struct skb_shared_hwtstamps {
  ktime_t hwtstamp;
  ktime_t syststamp;
};  

Members

hwtstamp

hardware time stamp transformed into duration since arbitrary point in time

syststamp

hwtstamp transformed to system time base

Description

Software time stamps generated by ktime_get_real are stored in skb->tstamp. The relation between the different kinds of time

stamps is as follows

syststamp and tstamp can be compared against each other in arbitrary combinations. The accuracy of a syststamp/tstamp/“syststamp from other device” comparison is limited by the accuracy of the transformation into system time base. This depends on the device driver and its underlying hardware.

hwtstamps can only be compared against other hwtstamps from the same device.

This structure is attached to packets as part of the skb_shared_info. Use skb_hwtstamps to get a pointer.


Name

struct skb_shared_tx — instructions for time stamping of outgoing packets

Synopsis

struct skb_shared_tx {
  struct {unnamed_struct};
  __u8 flags;
};  

Members

{unnamed_struct}

anonymous

flags

all shared_tx flags

Description

These flags are attached to packets as part of the skb_shared_info. Use skb_tx to get a pointer.


Name

struct sk_buff — socket buffer

Synopsis

struct sk_buff {
  struct sk_buff * next;
  struct sk_buff * prev;
  struct sock * sk;
  ktime_t tstamp;
  struct net_device * dev;
  unsigned long _skb_dst;
#ifdef CONFIG_XFRM
  struct sec_path * sp;
#endif
  char cb[48];
  unsigned int len;
  unsigned int data_len;
  __u16 mac_len;
  __u16 hdr_len;
  union {unnamed_union};
  __u32 priority;
  __u8 local_df:1;
  __u8 cloned:1;
  __u8 ip_summed:2;
  __u8 nohdr:1;
  __u8 nfctinfo:3;
  __u8 pkt_type:3;
  __u8 fclone:2;
  __u8 ipvs_property:1;
  __u8 peeked:1;
  __u8 nf_trace:1;
  __be16 protocol;
  void (* destructor) (struct sk_buff *skb);
#if defined(CONFIG_NF_CONNTRACK) || defined(CONFIG_NF_CONNTRACK_MODULE)
  struct nf_conntrack * nfct;
  struct sk_buff * nfct_reasm;
#endif
#ifdef CONFIG_BRIDGE_NETFILTER
  struct nf_bridge_info * nf_bridge;
#endif
  int iif;
  __u16 queue_mapping;
#ifdef CONFIG_NET_SCHED
  __u16 tc_index;
#ifdef CONFIG_NET_CLS_ACT
  __u16 tc_verd;
#endif
#endif
#ifdef CONFIG_IPV6_NDISC_NODETYPE
  __u8 ndisc_nodetype:2;
#endif
#ifdef CONFIG_NET_DMA
  dma_cookie_t dma_cookie;
#endif
#ifdef CONFIG_NETWORK_SECMARK
  __u32 secmark;
#endif
  __u32 mark;
  __u16 vlan_tci;
  sk_buff_data_t transport_header;
  sk_buff_data_t network_header;
  sk_buff_data_t mac_header;
  sk_buff_data_t tail;
  sk_buff_data_t end;
  unsigned char * head;
  unsigned char * data;
  unsigned int truesize;
  atomic_t users;
};  

Members

next

Next buffer in list

prev

Previous buffer in list

sk

Socket we are owned by

tstamp

Time we arrived

dev

Device we arrived on/are leaving by

_skb_dst

destination entry

sp

the security path, used for xfrm

cb[48]

Control buffer. Free for use by every layer. Put private vars here

len

Length of actual data

data_len

Data length

mac_len

Length of link layer header

hdr_len

writable header length of cloned skb

{unnamed_union}

anonymous

priority

Packet queueing priority

local_df

allow local fragmentation

cloned

Head may be cloned (check refcnt to be sure)

ip_summed

Driver fed us an IP checksum

nohdr

Payload reference only, must not modify header

nfctinfo

Relationship of this skb to the connection

pkt_type

Packet class

fclone

skbuff clone status

ipvs_property

skbuff is owned by ipvs

peeked

this packet has been seen already, so stats have been done for it, don't do them again

nf_trace

netfilter packet trace flag

protocol

Packet protocol from driver

destructor

Destruct function

nfct

Associated connection, if any

nfct_reasm

netfilter conntrack re-assembly pointer

nf_bridge

Saved data about a bridged frame - see br_netfilter.c

iif

ifindex of device we arrived on

queue_mapping

Queue mapping for multiqueue devices

tc_index

Traffic control index

tc_verd

traffic control verdict

ndisc_nodetype

router type (from link layer)

dma_cookie

a cookie to one of several possible DMA operations done by skb DMA functions

secmark

security marking

mark

Generic packet mark

vlan_tci

vlan tag control information

transport_header

Transport layer header

network_header

Network layer header

mac_header

Link layer header

tail

Tail pointer

end

End pointer

head

Head of buffer

data

Data head pointer

truesize

Buffer size

users

User count - see {datagram,tcp}.c


Name

skb_queue_empty — check if a queue is empty

Synopsis

int skb_queue_empty (list); 
const struct sk_buff_head *  list;

Arguments

list

queue head

Description

Returns true if the queue is empty, false otherwise.


Name

skb_queue_is_last — check if skb is the last entry in the queue

Synopsis

bool skb_queue_is_last (list,  
 skb); 
const struct sk_buff_head *  list;
const struct sk_buff *  skb;

Arguments

list

queue head

skb

buffer

Description

Returns true if skb is the last buffer on the list.


Name

skb_queue_is_first — check if skb is the first entry in the queue

Synopsis

bool skb_queue_is_first (list,  
 skb); 
const struct sk_buff_head *  list;
const struct sk_buff *  skb;

Arguments

list

queue head

skb

buffer

Description

Returns true if skb is the first buffer on the list.


Name

skb_queue_next — return the next packet in the queue

Synopsis

struct sk_buff * skb_queue_next (list,  
 skb); 
const struct sk_buff_head *  list;
const struct sk_buff *  skb;

Arguments

list

queue head

skb

current buffer

Description

Return the next packet in list after skb. It is only valid to call this if skb_queue_is_last evaluates to false.


Name

skb_queue_prev — return the prev packet in the queue

Synopsis

struct sk_buff * skb_queue_prev (list,  
 skb); 
const struct sk_buff_head *  list;
const struct sk_buff *  skb;

Arguments

list

queue head

skb

current buffer

Description

Return the prev packet in list before skb. It is only valid to call this if skb_queue_is_first evaluates to false.


Name

skb_get — reference buffer

Synopsis

struct sk_buff * skb_get (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to reference

Description

Makes another reference to a socket buffer and returns a pointer to the buffer.


Name

skb_cloned — is the buffer a clone

Synopsis

int skb_cloned (skb); 
const struct sk_buff *  skb;

Arguments

skb

buffer to check

Description

Returns true if the buffer was generated with skb_clone and is one of multiple shared copies of the buffer. Cloned buffers are shared data so must not be written to under normal circumstances.


Name

skb_header_cloned — is the header a clone

Synopsis

int skb_header_cloned (skb); 
const struct sk_buff *  skb;

Arguments

skb

buffer to check

Description

Returns true if modifying the header part of the buffer requires the data to be copied.


Name

skb_header_release — release reference to header

Synopsis

void skb_header_release (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to operate on

Description

Drop a reference to the header part of the buffer. This is done by acquiring a payload reference. You must not read from the header part of skb->data after this.


Name

skb_shared — is the buffer shared

Synopsis

int skb_shared (skb); 
const struct sk_buff *  skb;

Arguments

skb

buffer to check

Description

Returns true if more than one person has a reference to this buffer.


Name

skb_share_check — check if buffer is shared and if so clone it

Synopsis

struct sk_buff * skb_share_check (skb,  
 pri); 
struct sk_buff *  skb;
gfp_t  pri;

Arguments

skb

buffer to check

pri

priority for memory allocation

Description

If the buffer is shared the buffer is cloned and the old copy drops a reference. A new clone with a single reference is returned. If the buffer is not shared the original buffer is returned. When being called from interrupt status or with spinlocks held pri must be GFP_ATOMIC.

NULL is returned on a memory allocation failure.


Name

skb_unshare — make a copy of a shared buffer

Synopsis

struct sk_buff * skb_unshare (skb,  
 pri); 
struct sk_buff *  skb;
gfp_t  pri;

Arguments

skb

buffer to check

pri

priority for memory allocation

Description

If the socket buffer is a clone then this function creates a new copy of the data, drops a reference count on the old copy and returns the new copy with the reference count at 1. If the buffer is not a clone the original buffer is returned. When called with a spinlock held or from interrupt state pri must be GFP_ATOMIC

NULL is returned on a memory allocation failure.


Name

skb_peek —

Synopsis

struct sk_buff * skb_peek (list_); 
struct sk_buff_head *  list_;

Arguments

list_

list to peek at

Description

Peek an sk_buff. Unlike most other operations you _MUST_ be careful with this one. A peek leaves the buffer on the list and someone else may run off with it. You must hold the appropriate locks or have a private queue to do this.

Returns NULL for an empty list or a pointer to the head element. The reference count is not incremented and the reference is therefore volatile. Use with caution.


Name

skb_peek_tail —

Synopsis

struct sk_buff * skb_peek_tail (list_); 
struct sk_buff_head *  list_;

Arguments

list_

list to peek at

Description

Peek an sk_buff. Unlike most other operations you _MUST_ be careful with this one. A peek leaves the buffer on the list and someone else may run off with it. You must hold the appropriate locks or have a private queue to do this.

Returns NULL for an empty list or a pointer to the tail element. The reference count is not incremented and the reference is therefore volatile. Use with caution.


Name

skb_queue_len — get queue length

Synopsis

__u32 skb_queue_len (list_); 
const struct sk_buff_head *  list_;

Arguments

list_

list to measure

Description

Return the length of an sk_buff queue.


Name

__skb_queue_head_init — initialize non-spinlock portions of sk_buff_head

Synopsis

void __skb_queue_head_init (list); 
struct sk_buff_head *  list;

Arguments

list

queue to initialize

Description

This initializes only the list and queue length aspects of an sk_buff_head object. This allows to initialize the list aspects of an sk_buff_head without reinitializing things like the spinlock. It can also be used for on-stack sk_buff_head objects where the spinlock is known to not be used.


Name

skb_queue_splice — join two skb lists, this is designed for stacks

Synopsis

void skb_queue_splice (list,  
 head); 
const struct sk_buff_head *  list;
struct sk_buff_head *  head;

Arguments

list

the new list to add

head

the place to add it in the first list


Name

skb_queue_splice_init — join two skb lists and reinitialise the emptied list

Synopsis

void skb_queue_splice_init (list,  
 head); 
struct sk_buff_head *  list;
struct sk_buff_head *  head;

Arguments

list

the new list to add

head

the place to add it in the first list

Description

The list at list is reinitialised


Name

skb_queue_splice_tail — join two skb lists, each list being a queue

Synopsis

void skb_queue_splice_tail (list,  
 head); 
const struct sk_buff_head *  list;
struct sk_buff_head *  head;

Arguments

list

the new list to add

head

the place to add it in the first list


Name

skb_queue_splice_tail_init — join two skb lists and reinitialise the emptied list

Synopsis

void skb_queue_splice_tail_init (list,  
 head); 
struct sk_buff_head *  list;
struct sk_buff_head *  head;

Arguments

list

the new list to add

head

the place to add it in the first list

Description

Each of the lists is a queue. The list at list is reinitialised


Name

__skb_queue_after — queue a buffer at the list head

Synopsis

void __skb_queue_after (list,  
 prev,  
 newsk); 
struct sk_buff_head *  list;
struct sk_buff *  prev;
struct sk_buff *  newsk;

Arguments

list

list to use

prev

place after this buffer

newsk

buffer to queue

Description

Queue a buffer int the middle of a list. This function takes no locks and you must therefore hold required locks before calling it.

A buffer cannot be placed on two lists at the same time.


Name

skb_headroom — bytes at buffer head

Synopsis

unsigned int skb_headroom (skb); 
const struct sk_buff *  skb;

Arguments

skb

buffer to check

Description

Return the number of bytes of free space at the head of an sk_buff.


Name

skb_tailroom — bytes at buffer end

Synopsis

int skb_tailroom (skb); 
const struct sk_buff *  skb;

Arguments

skb

buffer to check

Description

Return the number of bytes of free space at the tail of an sk_buff


Name

skb_reserve — adjust headroom

Synopsis

void skb_reserve (skb,  
 len); 
struct sk_buff *  skb;
int  len;

Arguments

skb

buffer to alter

len

bytes to move

Description

Increase the headroom of an empty sk_buff by reducing the tail room. This is only allowed for an empty buffer.


Name

pskb_trim_unique — remove end from a paged unique (not cloned) buffer

Synopsis

void pskb_trim_unique (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to alter

len

new length

Description

This is identical to pskb_trim except that the caller knows that the skb is not cloned so we should never get an error due to out- of-memory.


Name

skb_orphan — orphan a buffer

Synopsis

void skb_orphan (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to orphan

Description

If a buffer currently has an owner then we call the owner's destructor function and make the skb unowned. The buffer continues to exist but is no longer charged to its former owner.


Name

__dev_alloc_skb — allocate an skbuff for receiving

Synopsis

struct sk_buff * __dev_alloc_skb (length,  
 gfp_mask); 
unsigned int  length;
gfp_t  gfp_mask;

Arguments

length

length to allocate

gfp_mask

get_free_pages mask, passed to alloc_skb

Description

Allocate a new sk_buff and assign it a usage count of one. The buffer has unspecified headroom built in. Users should allocate the headroom they think they need without accounting for the built in space. The built in space is used for optimisations.

NULL is returned if there is no free memory.


Name

netdev_alloc_skb — allocate an skbuff for rx on a specific device

Synopsis

struct sk_buff * netdev_alloc_skb (dev,  
 length); 
struct net_device *  dev;
unsigned int  length;

Arguments

dev

network device to receive on

length

length to allocate

Description

Allocate a new sk_buff and assign it a usage count of one. The buffer has unspecified headroom built in. Users should allocate the headroom they think they need without accounting for the built in space. The built in space is used for optimisations.

NULL is returned if there is no free memory. Although this function allocates memory it can be called from an interrupt.


Name

netdev_alloc_page — allocate a page for ps-rx on a specific device

Synopsis

struct page * netdev_alloc_page (dev); 
struct net_device *  dev;

Arguments

dev

network device to receive on

Description

Allocate a new page node local to the specified device.

NULL is returned if there is no free memory.


Name

skb_clone_writable — is the header of a clone writable

Synopsis

int skb_clone_writable (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to check

len

length up to which to write

Description

Returns true if modifying the header part of the cloned buffer does not requires the data to be copied.


Name

skb_cow — copy header of skb when it is required

Synopsis

int skb_cow (skb,  
 headroom); 
struct sk_buff *  skb;
unsigned int  headroom;

Arguments

skb

buffer to cow

headroom

needed headroom

Description

If the skb passed lacks sufficient headroom or its data part is shared, data is reallocated. If reallocation fails, an error is returned and original skb is not changed.

The result is skb with writable area skb->head...skb->tail and at least headroom of space at head.


Name

skb_cow_head — skb_cow but only making the head writable

Synopsis

int skb_cow_head (skb,  
 headroom); 
struct sk_buff *  skb;
unsigned int  headroom;

Arguments

skb

buffer to cow

headroom

needed headroom

Description

This function is identical to skb_cow except that we replace the skb_cloned check by skb_header_cloned. It should be used when you only need to push on some header and do not need to modify the data.


Name

skb_padto — pad an skbuff up to a minimal size

Synopsis

int skb_padto (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to pad

len

minimal length

Description

Pads up a buffer to ensure the trailing bytes exist and are blanked. If the buffer already contains sufficient data it is untouched. Otherwise it is extended. Returns zero on success. The skb is freed on error.


Name

skb_linearize — convert paged skb to linear one

Synopsis

int skb_linearize (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to linarize

Description

If there is no free memory -ENOMEM is returned, otherwise zero is returned and the old skb data released.


Name

skb_linearize_cow — make sure skb is linear and writable

Synopsis

int skb_linearize_cow (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to process

Description

If there is no free memory -ENOMEM is returned, otherwise zero is returned and the old skb data released.


Name

skb_postpull_rcsum — update checksum for received skb after pull

Synopsis

void skb_postpull_rcsum (skb,  
 start,  
 len); 
struct sk_buff *  skb;
const void *  start;
unsigned int  len;

Arguments

skb

buffer to update

start

start of data before pull

len

length of data pulled

Description

After doing a pull on a received packet, you need to call this to update the CHECKSUM_COMPLETE checksum, or set ip_summed to CHECKSUM_NONE so that it can be recomputed from scratch.


Name

pskb_trim_rcsum — trim received skb and update checksum

Synopsis

int pskb_trim_rcsum (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to trim

len

new length

Description

This is exactly the same as pskb_trim except that it ensures the checksum of received packets are still valid after the operation.


Name

skb_get_timestamp — get timestamp from a skb

Synopsis

void skb_get_timestamp (skb,  
 stamp); 
const struct sk_buff *  skb;
struct timeval *  stamp;

Arguments

skb

skb to get stamp from

stamp

pointer to struct timeval to store stamp in

Description

Timestamps are stored in the skb as offsets to a base timestamp. This function converts the offset back to a struct timeval and stores it in stamp.


Name

skb_checksum_complete — Calculate checksum of an entire packet

Synopsis

__sum16 skb_checksum_complete (skb); 
struct sk_buff *  skb;

Arguments

skb

packet to process

Description

This function calculates the checksum over the entire packet plus the value of skb->csum. The latter can be used to supply the checksum of a pseudo header as used by TCP/UDP. It returns the checksum.

For protocols that contain complete checksums such as ICMP/TCP/UDP, this function can be used to verify that checksum on received packets. In that case the function should return zero if the checksum is correct. In particular, this function will return zero if skb->ip_summed is CHECKSUM_UNNECESSARY which indicates that the hardware has already verified the correctness of the checksum.


Name

struct sock_common — minimal network layer representation of sockets

Synopsis

struct sock_common {
  union {unnamed_union};
  atomic_t skc_refcnt;
  unsigned int skc_hash;
  unsigned short skc_family;
  volatile unsigned char skc_state;
  unsigned char skc_reuse;
  int skc_bound_dev_if;
  struct hlist_node skc_bind_node;
  struct proto * skc_prot;
#ifdef CONFIG_NET_NS
  struct net * skc_net;
#endif
};  

Members

{unnamed_union}

anonymous

skc_refcnt

reference count

skc_hash

hash value used with various protocol lookup tables

skc_family

network address family

skc_state

Connection state

skc_reuse

SO_REUSEADDR setting

skc_bound_dev_if

bound device index if != 0

skc_bind_node

bind hash linkage for various protocol lookup tables

skc_prot

protocol handlers inside a network family

skc_net

reference to the network namespace of this socket

Description

This is the minimal network layer representation of sockets, the header for struct sock and struct inet_timewait_sock.


Name

struct sock — network layer representation of sockets

Synopsis

struct sock {
  struct sock_common __sk_common;
#define sk_node			__sk_common.skc_node
#define sk_nulls_node		__sk_common.skc_nulls_node
#define sk_refcnt		__sk_common.skc_refcnt
#define sk_copy_start		__sk_common.skc_hash
#define sk_hash			__sk_common.skc_hash
#define sk_family		__sk_common.skc_family
#define sk_state		__sk_common.skc_state
#define sk_reuse		__sk_common.skc_reuse
#define sk_bound_dev_if		__sk_common.skc_bound_dev_if
#define sk_bind_node		__sk_common.skc_bind_node
#define sk_prot			__sk_common.skc_prot
#define sk_net			__sk_common.skc_net
  unsigned int sk_shutdown:2;
  unsigned int sk_no_check:2;
  unsigned int sk_userlocks:4;
  unsigned int sk_protocol:8;
  unsigned int sk_type:16;
  int sk_rcvbuf;
  socket_lock_t sk_lock;
  struct sk_backlog;
  wait_queue_head_t * sk_sleep;
  struct dst_entry * sk_dst_cache;
#ifdef CONFIG_XFRM
  struct xfrm_policy * sk_policy[2];
#endif
  rwlock_t sk_dst_lock;
  atomic_t sk_rmem_alloc;
  atomic_t sk_wmem_alloc;
  atomic_t sk_omem_alloc;
  int sk_sndbuf;
  struct sk_buff_head sk_receive_queue;
  struct sk_buff_head sk_write_queue;
#ifdef CONFIG_NET_DMA
  struct sk_buff_head sk_async_wait_queue;
#endif
  int sk_wmem_queued;
  int sk_forward_alloc;
  gfp_t sk_allocation;
  int sk_route_caps;
  int sk_gso_type;
  unsigned int sk_gso_max_size;
  int sk_rcvlowat;
  unsigned long sk_flags;
  unsigned long sk_lingertime;
  struct sk_buff_head sk_error_queue;
  struct proto * sk_prot_creator;
  rwlock_t sk_callback_lock;
  int sk_err;
  int sk_err_soft;
  atomic_t sk_drops;
  unsigned short sk_ack_backlog;
  unsigned short sk_max_ack_backlog;
  __u32 sk_priority;
  struct ucred sk_peercred;
  long sk_rcvtimeo;
  long sk_sndtimeo;
  struct sk_filter * sk_filter;
  void * sk_protinfo;
  struct timer_list sk_timer;
  ktime_t sk_stamp;
  struct socket * sk_socket;
  void * sk_user_data;
  struct page * sk_sndmsg_page;
  struct sk_buff * sk_send_head;
  __u32 sk_sndmsg_off;
  int sk_write_pending;
#ifdef CONFIG_SECURITY
  void * sk_security;
#endif
  __u32 sk_mark;
  void (* sk_state_change) (struct sock *sk);
  void (* sk_data_ready) (struct sock *sk, int bytes);
  void (* sk_write_space) (struct sock *sk);
  void (* sk_error_report) (struct sock *sk);
  int (* sk_backlog_rcv) (struct sock *sk,struct sk_buff *skb);
  void (* sk_destruct) (struct sock *sk);
};  

Members

__sk_common

shared layout with inet_timewait_sock

sk_shutdown

mask of SEND_SHUTDOWN and/or RCV_SHUTDOWN

sk_no_check

SO_NO_CHECK setting, wether or not checkup packets

sk_userlocks

SO_SNDBUF and SO_RCVBUF settings

sk_protocol

which protocol this socket belongs in this network family

sk_type

socket type (SOCK_STREAM, etc)

sk_rcvbuf

size of receive buffer in bytes

sk_lock

synchronizer

sk_backlog

always used with the per-socket spinlock held

sk_sleep

sock wait queue

sk_dst_cache

destination cache

sk_policy[2]

flow policy

sk_dst_lock

destination cache lock

sk_rmem_alloc

receive queue bytes committed

sk_wmem_alloc

transmit queue bytes committed

sk_omem_alloc

"o“ is ”option“ or ”other"

sk_sndbuf

size of send buffer in bytes

sk_receive_queue

incoming packets

sk_write_queue

Packet sending queue

sk_async_wait_queue

DMA copied packets

sk_wmem_queued

persistent queue size

sk_forward_alloc

space allocated forward

sk_allocation

allocation mode

sk_route_caps

route capabilities (e.g. NETIF_F_TSO)

sk_gso_type

GSO type (e.g. SKB_GSO_TCPV4)

sk_gso_max_size

Maximum GSO segment size to build

sk_rcvlowat

SO_RCVLOWAT setting

sk_flags

SO_LINGER (l_onoff), SO_BROADCAST, SO_KEEPALIVE, SO_OOBINLINE settings, SO_TIMESTAMPING settings

sk_lingertime

SO_LINGER l_linger setting

sk_error_queue

rarely used

sk_prot_creator

sk_prot of original sock creator (see ipv6_setsockopt, IPV6_ADDRFORM for instance)

sk_callback_lock

used with the callbacks in the end of this struct

sk_err

last error

sk_err_soft

errors that don't cause failure but are the cause of a persistent failure not just 'timed out'

sk_drops

raw/udp drops counter

sk_ack_backlog

current listen backlog

sk_max_ack_backlog

listen backlog set in listen

sk_priority

SO_PRIORITY setting

sk_peercred

SO_PEERCRED setting

sk_rcvtimeo

SO_RCVTIMEO setting

sk_sndtimeo

SO_SNDTIMEO setting

sk_filter

socket filtering instructions

sk_protinfo

private area, net family specific, when not using slab

sk_timer

sock cleanup timer

sk_stamp

time stamp of last packet received

sk_socket

Identd and reporting IO signals

sk_user_data

RPC layer private data

sk_sndmsg_page

cached page for sendmsg

sk_send_head

front of stuff to transmit

sk_sndmsg_off

cached offset for sendmsg

sk_write_pending

a write to stream socket waits to start

sk_security

used by security modules

sk_mark

generic packet mark

sk_state_change

callback to indicate change in the state of the sock

sk_data_ready

callback to indicate there is data to be processed

sk_write_space

callback to indicate there is bf sending space available

sk_error_report

callback to indicate errors (e.g. MSG_ERRQUEUE)

sk_backlog_rcv

callback to process the backlog

sk_destruct

called at sock freeing time, i.e. when all refcnt == 0


Name

sk_filter_release —

Synopsis

void sk_filter_release (fp); 
struct sk_filter *  fp;

Arguments

fp

filter to remove

Description

Remove a filter from a socket and release its resources.


Name

sk_wmem_alloc_get — returns write allocations

Synopsis

int sk_wmem_alloc_get (sk); 
const struct sock *  sk;

Arguments

sk

socket

Description

Returns sk_wmem_alloc minus initial offset of one


Name

sk_rmem_alloc_get — returns read allocations

Synopsis

int sk_rmem_alloc_get (sk); 
const struct sock *  sk;

Arguments

sk

socket

Description

Returns sk_rmem_alloc


Name

sk_has_allocations — check if allocations are outstanding

Synopsis

int sk_has_allocations (sk); 
const struct sock *  sk;

Arguments

sk

socket

Description

Returns true if socket has write or read allocations


Name

sk_has_sleeper — check if there are any waiting processes

Synopsis

int sk_has_sleeper (sk); 
struct sock *  sk;

Arguments

sk

socket

Description

Returns true if socket has waiting processes

The purpose of the sk_has_sleeper and sock_poll_wait is to wrap the memory barrier call. They were added due to the race found within the tcp code.

Consider following tcp code paths

CPU1 CPU2

sys_select receive packet ... ... __add_wait_queue update tp->rcv_nxt ... ... tp->rcv_nxt check sock_def_readable ... { schedule ... if (sk->sk_sleep && waitqueue_active(sk->sk_sleep)) wake_up_interruptible(sk->sk_sleep) ... }

The race for tcp fires when the __add_wait_queue changes done by CPU1 stay in its cache, and so does the tp->rcv_nxt update on CPU2 side. The CPU1 could then endup calling schedule and sleep forever if there are no more data on the socket.

The sk_has_sleeper is always called right after a call to read_lock, so we can use smp_mb__after_lock barrier.


Name

sock_poll_wait — place memory barrier behind the poll_wait call.

Synopsis

void sock_poll_wait (filp,  
 wait_address,  
 p); 
struct file *  filp;
wait_queue_head_t *  wait_address;
poll_table *  p;

Arguments

filp

file

wait_address

socket wait queue

p

poll_table

Description

See the comments in the sk_has_sleeper function.


Name

sk_eat_skb — Release a skb if it is no longer needed

Synopsis

void sk_eat_skb (sk,  
 skb,  
 copied_early); 
struct sock *  sk;
struct sk_buff *  skb;
int  copied_early;

Arguments

sk

socket to eat this skb from

skb

socket buffer to eat

copied_early

flag indicating whether DMA operations copied this data early

Description

This routine must be called with interrupts disabled or with the socket locked so that the sk_buff queue operation is ok.


Name

sockfd_lookup — Go from a file number to its socket slot

Synopsis

struct socket * sockfd_lookup (fd,  
 err); 
int  fd;
int *  err;

Arguments

fd

file handle

err

pointer to an error code return

Description

The file handle passed in is locked and the socket it is bound too is returned. If an error occurs the err pointer is overwritten with a negative errno code and NULL is returned. The function checks for both invalid handles and passing a handle which is not a socket.

On a success the socket object pointer is returned.


Name

sock_release — close a socket

Synopsis

void sock_release (sock); 
struct socket *  sock;

Arguments

sock

socket to close

Description

The socket is released from the protocol stack if it has a release callback, and the inode is then released if the socket is bound to an inode not a file.


Name

sock_register — add a socket protocol handler

Synopsis

int sock_register (ops); 
const struct net_proto_family *  ops;

Arguments

ops

description of protocol

Description

This function is called by a protocol handler that wants to advertise its address family, and have it linked into the socket interface. The value ops->family coresponds to the socket system call protocol family.


Name

sock_unregister — remove a protocol handler

Synopsis

void sock_unregister (family); 
int  family;

Arguments

family

protocol family to remove

Description

This function is called by a protocol handler that wants to remove its address family, and have it unlinked from the new socket creation.

If protocol handler is a module, then it can use module reference counts to protect against new references. If protocol handler is not a module then it needs to provide its own protection in the ops->create routine.


Name

skb_over_panic — private function

Synopsis

void skb_over_panic (skb,  
 sz,  
 here); 
struct sk_buff *  skb;
int  sz;
void *  here;

Arguments

skb

buffer

sz

size

here

address

Description

Out of line support code for skb_put. Not user callable.


Name

skb_under_panic — private function

Synopsis

void skb_under_panic (skb,  
 sz,  
 here); 
struct sk_buff *  skb;
int  sz;
void *  here;

Arguments

skb

buffer

sz

size

here

address

Description

Out of line support code for skb_push. Not user callable.


Name

__alloc_skb — allocate a network buffer

Synopsis

struct sk_buff * __alloc_skb (size,  
 gfp_mask,  
 fclone,  
 node); 
unsigned int  size;
gfp_t  gfp_mask;
int  fclone;
int  node;

Arguments

size

size to allocate

gfp_mask

allocation mask

fclone

allocate from fclone cache instead of head cache and allocate a cloned (child) skb

node

numa node to allocate memory on

Description

Allocate a new sk_buff. The returned buffer has no headroom and a tail room of size bytes. The object has a reference count of one. The return is the buffer. On a failure the return is NULL.

Buffers may only be allocated from interrupts using a gfp_mask of GFP_ATOMIC.


Name

__netdev_alloc_skb — allocate an skbuff for rx on a specific device

Synopsis

struct sk_buff * __netdev_alloc_skb (dev,  
 length,  
 gfp_mask); 
struct net_device *  dev;
unsigned int  length;
gfp_t  gfp_mask;

Arguments

dev

network device to receive on

length

length to allocate

gfp_mask

get_free_pages mask, passed to alloc_skb

Description

Allocate a new sk_buff and assign it a usage count of one. The buffer has unspecified headroom built in. Users should allocate the headroom they think they need without accounting for the built in space. The built in space is used for optimisations.

NULL is returned if there is no free memory.


Name

dev_alloc_skb — allocate an skbuff for receiving

Synopsis

struct sk_buff * dev_alloc_skb (length); 
unsigned int  length;

Arguments

length

length to allocate

Description

Allocate a new sk_buff and assign it a usage count of one. The buffer has unspecified headroom built in. Users should allocate the headroom they think they need without accounting for the built in space. The built in space is used for optimisations.

NULL is returned if there is no free memory. Although this function allocates memory it can be called from an interrupt.


Name

__kfree_skb — private function

Synopsis

void __kfree_skb (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer

Description

Free an sk_buff. Release anything attached to the buffer. Clean the state. This is an internal helper function. Users should always call kfree_skb


Name

kfree_skb — free an sk_buff

Synopsis

void kfree_skb (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to free

Description

Drop a reference to the buffer and free it if the usage count has hit zero.


Name

consume_skb — free an skbuff

Synopsis

void consume_skb (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to free

Description

Drop a ref to the buffer and free it if the usage count has hit zero Functions identically to kfree_skb, but kfree_skb assumes that the frame is being dropped after a failure and notes that


Name

skb_recycle_check — check if skb can be reused for receive

Synopsis

int skb_recycle_check (skb,  
 skb_size); 
struct sk_buff *  skb;
int  skb_size;

Arguments

skb

buffer

skb_size

minimum receive buffer size

Description

Checks that the skb passed in is not shared or cloned, and that it is linear and its head portion at least as large as skb_size so that it can be recycled as a receive buffer. If these conditions are met, this function does any necessary reference count dropping and cleans up the skbuff as if it just came from __alloc_skb.


Name

skb_morph — morph one skb into another

Synopsis

struct sk_buff * skb_morph (dst,  
 src); 
struct sk_buff *  dst;
struct sk_buff *  src;

Arguments

dst

the skb to receive the contents

src

the skb to supply the contents

Description

This is identical to skb_clone except that the target skb is supplied by the user.

The target skb is returned upon exit.


Name

skb_clone — duplicate an sk_buff

Synopsis

struct sk_buff * skb_clone (skb,  
 gfp_mask); 
struct sk_buff *  skb;
gfp_t  gfp_mask;

Arguments

skb

buffer to clone

gfp_mask

allocation priority

Description

Duplicate an sk_buff. The new one is not owned by a socket. Both copies share the same packet data but not structure. The new buffer has a reference count of 1. If the allocation fails the function returns NULL otherwise the new buffer is returned.

If this function is called from an interrupt gfp_mask must be GFP_ATOMIC.


Name

skb_copy — create private copy of an sk_buff

Synopsis

struct sk_buff * skb_copy (skb,  
 gfp_mask); 
const struct sk_buff *  skb;
gfp_t  gfp_mask;

Arguments

skb

buffer to copy

gfp_mask

allocation priority

Description

Make a copy of both an sk_buff and its data. This is used when the caller wishes to modify the data and needs a private copy of the data to alter. Returns NULL on failure or the pointer to the buffer on success. The returned buffer has a reference count of 1.

As by-product this function converts non-linear sk_buff to linear one, so that sk_buff becomes completely private and caller is allowed to modify all the data of returned buffer. This means that this function is not recommended for use in circumstances when only header is going to be modified. Use pskb_copy instead.


Name

pskb_copy — create copy of an sk_buff with private head.

Synopsis

struct sk_buff * pskb_copy (skb,  
 gfp_mask); 
struct sk_buff *  skb;
gfp_t  gfp_mask;

Arguments

skb

buffer to copy

gfp_mask

allocation priority

Description

Make a copy of both an sk_buff and part of its data, located in header. Fragmented data remain shared. This is used when the caller wishes to modify only header of sk_buff and needs private copy of the header to alter. Returns NULL on failure or the pointer to the buffer on success. The returned buffer has a reference count of 1.


Name

pskb_expand_head — reallocate header of sk_buff

Synopsis

int pskb_expand_head (skb,  
 nhead,  
 ntail,  
 gfp_mask); 
struct sk_buff *  skb;
int  nhead;
int  ntail;
gfp_t  gfp_mask;

Arguments

skb

buffer to reallocate

nhead

room to add at head

ntail

room to add at tail

gfp_mask

allocation priority

Description

Expands (or creates identical copy, if nhead and ntail are zero) header of skb. sk_buff itself is not changed. sk_buff MUST have reference count of 1. Returns zero in the case of success or error, if expansion failed. In the last case, sk_buff is not changed.

All the pointers pointing into skb header may change and must be reloaded after call to this function.


Name

skb_copy_expand — copy and expand sk_buff

Synopsis

struct sk_buff * skb_copy_expand (skb,  
 newheadroom,  
 newtailroom,  
 gfp_mask); 
const struct sk_buff *  skb;
int  newheadroom;
int  newtailroom;
gfp_t  gfp_mask;

Arguments

skb

buffer to copy

newheadroom

new free bytes at head

newtailroom

new free bytes at tail

gfp_mask

allocation priority

Description

Make a copy of both an sk_buff and its data and while doing so allocate additional space.

This is used when the caller wishes to modify the data and needs a private copy of the data to alter as well as more space for new fields. Returns NULL on failure or the pointer to the buffer on success. The returned buffer has a reference count of 1.

You must pass GFP_ATOMIC as the allocation priority if this function is called from an interrupt.


Name

skb_pad — zero pad the tail of an skb

Synopsis

int skb_pad (skb,  
 pad); 
struct sk_buff *  skb;
int  pad;

Arguments

skb

buffer to pad

pad

space to pad

Description

Ensure that a buffer is followed by a padding area that is zero filled. Used by network drivers which may DMA or transfer data beyond the buffer end onto the wire.

May return error in out of memory cases. The skb is freed on error.


Name

skb_put — add data to a buffer

Synopsis

unsigned char * skb_put (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to use

len

amount of data to add

Description

This function extends the used data area of the buffer. If this would exceed the total buffer size the kernel will panic. A pointer to the first byte of the extra data is returned.


Name

skb_push — add data to the start of a buffer

Synopsis

unsigned char * skb_push (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to use

len

amount of data to add

Description

This function extends the used data area of the buffer at the buffer start. If this would exceed the total buffer headroom the kernel will panic. A pointer to the first byte of the extra data is returned.


Name

skb_pull — remove data from the start of a buffer

Synopsis

unsigned char * skb_pull (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to use

len

amount of data to remove

Description

This function removes data from the start of a buffer, returning the memory to the headroom. A pointer to the next data in the buffer is returned. Once the data has been pulled future pushes will overwrite the old data.


Name

skb_trim — remove end from a buffer

Synopsis

void skb_trim (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to alter

len

new length

Description

Cut the length of a buffer down by removing data from the tail. If the buffer is already under the length specified it is not modified. The skb must be linear.


Name

__pskb_pull_tail — advance tail of skb header

Synopsis

unsigned char * __pskb_pull_tail (skb,  
 delta); 
struct sk_buff *  skb;
int  delta;

Arguments

skb

buffer to reallocate

delta

number of bytes to advance tail

Description

The function makes a sense only on a fragmented sk_buff, it expands header moving its tail forward and copying necessary data from fragmented part.

sk_buff MUST have reference count of 1.

Returns NULL (and sk_buff does not change) if pull failed or value of new tail of skb in the case of success.

All the pointers pointing into skb header may change and must be reloaded after call to this function.


Name

skb_store_bits — store bits from kernel buffer to skb

Synopsis

int skb_store_bits (skb,  
 offset,  
 from,  
 len); 
struct sk_buff *  skb;
int  offset;
const void *  from;
int  len;

Arguments

skb

destination buffer

offset

offset in destination

from

source buffer

len

number of bytes to copy

Description

Copy the specified number of bytes from the source buffer to the destination skb. This function handles all the messy bits of traversing fragment lists and such.


Name

skb_dequeue — remove from the head of the queue

Synopsis

struct sk_buff * skb_dequeue (list); 
struct sk_buff_head *  list;

Arguments

list

list to dequeue from

Description

Remove the head of the list. The list lock is taken so the function may be used safely with other locking list functions. The head item is returned or NULL if the list is empty.


Name

skb_dequeue_tail — remove from the tail of the queue

Synopsis

struct sk_buff * skb_dequeue_tail (list); 
struct sk_buff_head *  list;

Arguments

list

list to dequeue from

Description

Remove the tail of the list. The list lock is taken so the function may be used safely with other locking list functions. The tail item is returned or NULL if the list is empty.


Name

skb_queue_purge — empty a list

Synopsis

void skb_queue_purge (list); 
struct sk_buff_head *  list;

Arguments

list

list to empty

Description

Delete all buffers on an sk_buff list. Each buffer is removed from the list and one reference dropped. This function takes the list lock and is atomic with respect to other list locking functions.


Name

skb_queue_head — queue a buffer at the list head

Synopsis

void skb_queue_head (list,  
 newsk); 
struct sk_buff_head *  list;
struct sk_buff *  newsk;

Arguments

list

list to use

newsk

buffer to queue

Description

Queue a buffer at the start of the list. This function takes the list lock and can be used safely with other locking sk_buff functions safely.

A buffer cannot be placed on two lists at the same time.


Name

skb_queue_tail — queue a buffer at the list tail

Synopsis

void skb_queue_tail (list,  
 newsk); 
struct sk_buff_head *  list;
struct sk_buff *  newsk;

Arguments

list

list to use

newsk

buffer to queue

Description

Queue a buffer at the tail of the list. This function takes the list lock and can be used safely with other locking sk_buff functions safely.

A buffer cannot be placed on two lists at the same time.


Name

skb_unlink — remove a buffer from a list

Synopsis

void skb_unlink (skb,  
 list); 
struct sk_buff *  skb;
struct sk_buff_head *  list;

Arguments

skb

buffer to remove

list

list to use

Description

Remove a packet from a list. The list locks are taken and this function is atomic with respect to other list locked calls

You must know what list the SKB is on.


Name

skb_append — append a buffer

Synopsis

void skb_append (old,  
 newsk,  
 list); 
struct sk_buff *  old;
struct sk_buff *  newsk;
struct sk_buff_head *  list;

Arguments

old

buffer to insert after

newsk

buffer to insert

list

list to use

Description

Place a packet after a given packet in a list. The list locks are taken and this function is atomic with respect to other list locked calls. A buffer cannot be placed on two lists at the same time.


Name

skb_insert — insert a buffer

Synopsis

void skb_insert (old,  
 newsk,  
 list); 
struct sk_buff *  old;
struct sk_buff *  newsk;
struct sk_buff_head *  list;

Arguments

old

buffer to insert before

newsk

buffer to insert

list

list to use

Description

Place a packet before a given packet in a list. The list locks are taken and this function is atomic with respect to other list locked calls.

A buffer cannot be placed on two lists at the same time.


Name

skb_split — Split fragmented skb to two parts at length len.

Synopsis

void skb_split (skb,  
 skb1,  
 len); 
struct sk_buff *  skb;
struct sk_buff *  skb1;
const u32  len;

Arguments

skb

the buffer to split

skb1

the buffer to receive the second part

len

new length for skb


Name

skb_prepare_seq_read — Prepare a sequential read of skb data

Synopsis

void skb_prepare_seq_read (skb,  
 from,  
 to,  
 st); 
struct sk_buff *  skb;
unsigned int  from;
unsigned int  to;
struct skb_seq_state *  st;

Arguments

skb

the buffer to read

from

lower offset of data to be read

to

upper offset of data to be read

st

state variable

Description

Initializes the specified state variable. Must be called before invoking skb_seq_read for the first time.


Name

skb_seq_read — Sequentially read skb data

Synopsis

unsigned int skb_seq_read (consumed,  
 data,  
 st); 
unsigned int  consumed;
const u8 **  data;
struct skb_seq_state *  st;

Arguments

consumed

number of bytes consumed by the caller so far

data

destination pointer for data to be returned

st

state variable

Description

Reads a block of skb data at consumed relative to the lower offset specified to skb_prepare_seq_read. Assigns the head of the data block to data and returns the length of the block or 0 if the end of the skb data or the upper offset has been reached.

The caller is not required to consume all of the data returned, i.e. consumed is typically set to the number of bytes already consumed and the next call to skb_seq_read will return the remaining part of the block.

Note 1

The size of each block of data returned can be arbitary, this limitation is the cost for zerocopy seqeuental reads of potentially non linear data.

Note 2

Fragment lists within fragments are not implemented at the moment, state->root_skb could be replaced with a stack for this purpose.


Name

skb_abort_seq_read — Abort a sequential read of skb data

Synopsis

void skb_abort_seq_read (st); 
struct skb_seq_state *  st;

Arguments

st

state variable

Description

Must be called if skb_seq_read was not called until it returned 0.


Name

skb_find_text — Find a text pattern in skb data

Synopsis

unsigned int skb_find_text (skb,  
 from,  
 to,  
 config,  
 state); 
struct sk_buff *  skb;
unsigned int  from;
unsigned int  to;
struct ts_config *  config;
struct ts_state *  state;

Arguments

skb

the buffer to look in

from

search offset

to

search limit

config

textsearch configuration

state

uninitialized textsearch state variable

Description

Finds a pattern in the skb data according to the specified textsearch configuration. Use textsearch_next to retrieve subsequent occurrences of the pattern. Returns the offset to the first occurrence or UINT_MAX if no match was found.


Name

skb_append_datato_frags — append the user data to a skb

Synopsis

int skb_append_datato_frags (sk,  
 skb,  
 getfrag,  
 from,  
 length); 
struct sock *  sk;
struct sk_buff *  skb;
int (* getfrag(void *from, char *to, int offset, int len, int odd, struct sk_buff *skb);
void *  from;
int  length;

Arguments

sk

sock structure

skb

skb structure to be appened with user data.

getfrag

call back function to be used for getting the user data

from

pointer to user message iov

length

length of the iov message

Description

This procedure append the user data in the fragment part of the skb if any page alloc fails user this procedure returns -ENOMEM


Name

skb_pull_rcsum — pull skb and update receive checksum

Synopsis

unsigned char * skb_pull_rcsum (skb,  
 len); 
struct sk_buff *  skb;
unsigned int  len;

Arguments

skb

buffer to update

len

length of data pulled

Description

This function performs an skb_pull on the packet and updates the CHECKSUM_COMPLETE checksum. It should be used on receive path processing instead of skb_pull unless you know that the checksum difference is zero (e.g., a valid IP header) or you are setting ip_summed to CHECKSUM_NONE.


Name

skb_segment — Perform protocol segmentation on skb.

Synopsis

struct sk_buff * skb_segment (skb,  
 features); 
struct sk_buff *  skb;
int  features;

Arguments

skb

buffer to segment

features

features for the output path (see dev->features)

Description

This function performs segmentation on the given skb. It returns a pointer to the first in a list of new skbs for the segments. In case of error it returns ERR_PTR(err).


Name

skb_cow_data — Check that a socket buffer's data buffers are writable

Synopsis

int skb_cow_data (skb,  
 tailbits,  
 trailer); 
struct sk_buff *  skb;
int  tailbits;
struct sk_buff **  trailer;

Arguments

skb

The socket buffer to check.

tailbits

Amount of trailing space to be added

trailer

Returned pointer to the skb where the tailbits space begins

Description

Make sure that the data buffers attached to a socket buffer are writable. If they are not, private copies are made of the data buffers and the socket buffer is set to use these instead.

If tailbits is given, make sure that there is space to write tailbits bytes of data beyond current end of socket buffer. trailer will be set to point to the skb in which this space begins.

The number of scatterlist elements required to completely map the COW'd and extended socket buffer will be returned.


Name

skb_partial_csum_set — set up and verify partial csum values for packet

Synopsis

bool skb_partial_csum_set (skb,  
 start,  
 off); 
struct sk_buff *  skb;
u16  start;
u16  off;

Arguments

skb

the skb to set

start

the number of bytes after skb->data to start checksumming.

off

the offset from start to place the checksum.

Description

For untrusted partially-checksummed packets, we need to make sure the values for skb->csum_start and skb->csum_offset are valid so we don't oops.

This function checks and sets those values and skb->ip_summed: if this returns false you should drop the packet.


Name

sk_alloc — All socket objects are allocated here

Synopsis

struct sock * sk_alloc (net,  
 family,  
 priority,  
 prot); 
struct net *  net;
int  family;
gfp_t  priority;
struct proto *  prot;

Arguments

net

the applicable net namespace

family

protocol family

priority

for allocation (GFP_KERNEL, GFP_ATOMIC, etc)

prot

struct proto associated with this new sock instance


Name

sk_wait_data — wait for data to arrive at sk_receive_queue

Synopsis

int sk_wait_data (sk,  
 timeo); 
struct sock *  sk;
long *  timeo;

Arguments

sk

sock to wait on

timeo

for how long

Description

Now socket state including sk->sk_err is changed only under lock, hence we may omit checks after joining wait queue. We check receive queue before schedule only as optimization; it is very likely that release_sock added new data.


Name

__sk_mem_schedule — increase sk_forward_alloc and memory_allocated

Synopsis

int __sk_mem_schedule (sk,  
 size,  
 kind); 
struct sock *  sk;
int  size;
int  kind;

Arguments

sk

socket

size

memory size to allocate

kind

allocation type

Description

If kind is SK_MEM_SEND, it means wmem allocation. Otherwise it means rmem allocation. This function assumes that protocols which have memory_pressure use sk_wmem_queued as write buffer accounting.


Name

__sk_mem_reclaim — reclaim memory_allocated

Synopsis

void __sk_mem_reclaim (sk); 
struct sock *  sk;

Arguments

sk

socket


Name

__skb_recv_datagram — Receive a datagram skbuff

Synopsis

struct sk_buff * __skb_recv_datagram (sk,  
 flags,  
 peeked,  
 err); 
struct sock *  sk;
unsigned  flags;
int *  peeked;
int *  err;

Arguments

sk

socket

flags

MSG_ flags

peeked

returns non-zero if this packet has been seen before

err

error code returned

Description

Get a datagram skbuff, understands the peeking, nonblocking wakeups and possible races. This replaces identical code in packet, raw and udp, as well as the IPX AX.25 and Appletalk. It also finally fixes the long standing peek and read race for datagram sockets. If you alter this routine remember it must be re-entrant.

This function will lock the socket if a skb is returned, so the caller needs to unlock the socket in that case (usually by calling skb_free_datagram)

* It does not lock socket since today. This function is * free of race conditions. This measure should/can improve * significantly datagram socket latencies at high loads, * when data copying to user space takes lots of time. * (BTW I've just killed the last cli in IP/IPv6/core/netlink/packet * 8) Great win.) * --ANK (980729)

The order of the tests when we find no data waiting are specified quite explicitly by POSIX 1003.1g, don't change them without having the standard around please.


Name

skb_kill_datagram — Free a datagram skbuff forcibly

Synopsis

int skb_kill_datagram (sk,  
 skb,  
 flags); 
struct sock *  sk;
struct sk_buff *  skb;
unsigned int  flags;

Arguments

sk

socket

skb

datagram skbuff

flags

MSG_ flags

Description

This function frees a datagram skbuff that was received by skb_recv_datagram. The flags argument must match the one used for skb_recv_datagram.

If the MSG_PEEK flag is set, and the packet is still on the receive queue of the socket, it will be taken off the queue before it is freed.

This function currently only disables BH when acquiring the sk_receive_queue lock. Therefore it must not be used in a context where that lock is acquired in an IRQ context.

It returns 0 if the packet was removed by us.


Name

skb_copy_datagram_iovec — Copy a datagram to an iovec.

Synopsis

int skb_copy_datagram_iovec (skb,  
 offset,  
 to,  
 len); 
const struct sk_buff *  skb;
int  offset;
struct iovec *  to;
int  len;

Arguments

skb

buffer to copy

offset

offset in the buffer to start copying from

to

io vector to copy to

len

amount of data to copy from buffer to iovec

Note

the iovec is modified during the copy.


Name

skb_copy_datagram_const_iovec — Copy a datagram to an iovec.

Synopsis

int skb_copy_datagram_const_iovec (skb,  
 offset,  
 to,  
 to_offset,  
 len); 
const struct sk_buff *  skb;
int  offset;
const struct iovec *  to;
int  to_offset;
int  len;

Arguments

skb

buffer to copy

offset

offset in the buffer to start copying from

to

io vector to copy to

to_offset

offset in the io vector to start copying to

len

amount of data to copy from buffer to iovec

Description

Returns 0 or -EFAULT.

Note

the iovec is not modified during the copy.


Name

skb_copy_datagram_from_iovec — Copy a datagram from an iovec.

Synopsis

int skb_copy_datagram_from_iovec (skb,  
 offset,  
 from,  
 from_offset,  
 len); 
struct sk_buff *  skb;
int  offset;
const struct iovec *  from;
int  from_offset;
int  len;

Arguments

skb

buffer to copy

offset

offset in the buffer to start copying to

from

io vector to copy to

from_offset

offset in the io vector to start copying from

len

amount of data to copy to buffer from iovec

Description

Returns 0 or -EFAULT.

Note

the iovec is not modified during the copy.


Name

skb_copy_and_csum_datagram_iovec — Copy and checkum skb to user iovec.

Synopsis

int skb_copy_and_csum_datagram_iovec (skb,  
 hlen,  
 iov); 
struct sk_buff *  skb;
int  hlen;
struct iovec *  iov;

Arguments

skb

skbuff

hlen

hardware length

iov

io vector

Description

Caller _must_ check that skb will fit to this iovec.

Returns

0 - success. -EINVAL - checksum failure. -EFAULT - fault during copy. Beware, in this case iovec can be modified!


Name

datagram_poll — generic datagram poll

Synopsis

unsigned int datagram_poll (file,  
 sock,  
 wait); 
struct file *  file;
struct socket *  sock;
poll_table *  wait;

Arguments

file

file struct

sock

socket

wait

poll table

Datagram poll

Again totally generic. This also handles sequenced packet sockets providing the socket receive queue is only ever holding data ready to receive.

Note

when you _don't_ use this routine for this protocol, and you use a different write policy from sock_writeable then please supply your own write_space callback.


Name

sk_stream_write_space — stream socket write_space callback.

Synopsis

void sk_stream_write_space (sk); 
struct sock *  sk;

Arguments

sk

socket

FIXME

write proper description


Name

sk_stream_wait_connect — Wait for a socket to get into the connected state

Synopsis

int sk_stream_wait_connect (sk,  
 timeo_p); 
struct sock *  sk;
long *  timeo_p;

Arguments

sk

sock to wait on

timeo_p

for how long to wait

Description

Must be called with the socket locked.


Name

sk_stream_wait_memory — Wait for more memory for a socket

Synopsis

int sk_stream_wait_memory (sk,  
 timeo_p); 
struct sock *  sk;
long *  timeo_p;

Arguments

sk

socket to wait for memory

timeo_p

for how long

Socket Filter

Name

sk_filter — run a packet through a socket filter

Synopsis

int sk_filter (sk,  
 skb); 
struct sock *  sk;
struct sk_buff *  skb;

Arguments

sk

sock associated with sk_buff

skb

buffer to filter

Description

Run the filter code and then cut skb->data to correct size returned by sk_run_filter. If pkt_len is 0 we toss packet. If skb->len is smaller than pkt_len we keep whole skb->data. This is the socket level wrapper to sk_run_filter. It returns 0 if the packet should be accepted or -EPERM if the packet should be tossed.


Name

sk_run_filter — run a filter on a socket

Synopsis

unsigned int sk_run_filter (skb,  
 filter,  
 flen); 
struct sk_buff *  skb;
struct sock_filter *  filter;
int  flen;

Arguments

skb

buffer to run the filter on

filter

filter to apply

flen

length of filter

Description

Decode and apply filter instructions to the skb->data. Return length to keep, 0 for none. skb is the data we are filtering, filter is the array of filter instructions, and len is the number of filter blocks in the array.


Name

sk_chk_filter — verify socket filter code

Synopsis

int sk_chk_filter (filter,  
 flen); 
struct sock_filter *  filter;
int  flen;

Arguments

filter

filter to verify

flen

length of filter

Description

Check the user's filter code. If we let some ugly filter code slip through kaboom! The filter must contain no references or jumps that are out of range, no illegal instructions, and must end with a RET instruction.

All jumps are forward as they are not signed.

Returns 0 if the rule set is legal or -EINVAL if not.

Generic Network Statistics

Name

struct gnet_stats_basic — byte/packet throughput statistics

Synopsis

struct gnet_stats_basic {
  __u64 bytes;
  __u32 packets;
};  

Members

bytes

number of seen bytes

packets

number of seen packets


Name

struct gnet_stats_rate_est — rate estimator

Synopsis

struct gnet_stats_rate_est {
  __u32 bps;
  __u32 pps;
};  

Members

bps

current byte rate

pps

current packet rate


Name

struct gnet_stats_queue — queuing statistics

Synopsis

struct gnet_stats_queue {
  __u32 qlen;
  __u32 backlog;
  __u32 drops;
  __u32 requeues;
  __u32 overlimits;
};  

Members

qlen

queue length

backlog

backlog size of queue

drops

number of dropped packets

requeues

number of requeues

overlimits

number of enqueues over the limit


Name

struct gnet_estimator — rate estimator configuration

Synopsis

struct gnet_estimator {
  signed char interval;
  unsigned char ewma_log;
};  

Members

interval

sampling period

ewma_log

the log of measurement window weight


Name

gnet_stats_start_copy_compat — start dumping procedure in compatibility mode

Synopsis

int gnet_stats_start_copy_compat (skb,  
 type,  
 tc_stats_type,  
 xstats_type,  
 lock,  
 d); 
struct sk_buff *  skb;
int  type;
int  tc_stats_type;
int  xstats_type;
spinlock_t *  lock;
struct gnet_dump *  d;

Arguments

skb

socket buffer to put statistics TLVs into

type

TLV type for top level statistic TLV

tc_stats_type

TLV type for backward compatibility struct tc_stats TLV

xstats_type

TLV type for backward compatibility xstats TLV

lock

statistics lock

d

dumping handle

Description

Initializes the dumping handle, grabs the statistic lock and appends an empty TLV header to the socket buffer for use a container for all other statistic TLVS.

The dumping handle is marked to be in backward compatibility mode telling all gnet_stats_copy_XXX functions to fill a local copy of struct tc_stats.

Returns 0 on success or -1 if the room in the socket buffer was not sufficient.


Name

gnet_stats_start_copy — start dumping procedure in compatibility mode

Synopsis

int gnet_stats_start_copy (skb,  
 type,  
 lock,  
 d); 
struct sk_buff *  skb;
int  type;
spinlock_t *  lock;
struct gnet_dump *  d;

Arguments

skb

socket buffer to put statistics TLVs into

type

TLV type for top level statistic TLV

lock

statistics lock

d

dumping handle

Description

Initializes the dumping handle, grabs the statistic lock and appends an empty TLV header to the socket buffer for use a container for all other statistic TLVS.

Returns 0 on success or -1 if the room in the socket buffer was not sufficient.


Name

gnet_stats_copy_basic — copy basic statistics into statistic TLV

Synopsis

int gnet_stats_copy_basic (d,  
 b); 
struct gnet_dump *  d;
struct gnet_stats_basic_packed *  b;

Arguments

d

dumping handle

b

basic statistics

Description

Appends the basic statistics to the top level TLV created by gnet_stats_start_copy.

Returns 0 on success or -1 with the statistic lock released if the room in the socket buffer was not sufficient.


Name

gnet_stats_copy_rate_est — copy rate estimator statistics into statistics TLV

Synopsis

int gnet_stats_copy_rate_est (d,  
 r); 
struct gnet_dump *  d;
struct gnet_stats_rate_est *  r;

Arguments

d

dumping handle

r

rate estimator statistics

Description

Appends the rate estimator statistics to the top level TLV created by gnet_stats_start_copy.

Returns 0 on success or -1 with the statistic lock released if the room in the socket buffer was not sufficient.


Name

gnet_stats_copy_queue — copy queue statistics into statistics TLV

Synopsis

int gnet_stats_copy_queue (d,  
 q); 
struct gnet_dump *  d;
struct gnet_stats_queue *  q;

Arguments

d

dumping handle

q

queue statistics

Description

Appends the queue statistics to the top level TLV created by gnet_stats_start_copy.

Returns 0 on success or -1 with the statistic lock released if the room in the socket buffer was not sufficient.


Name

gnet_stats_copy_app — copy application specific statistics into statistics TLV

Synopsis

int gnet_stats_copy_app (d,  
 st,  
 len); 
struct gnet_dump *  d;
void *  st;
int  len;

Arguments

d

dumping handle

st

application specific statistics data

len

length of data

Description

Appends the application sepecific statistics to the top level TLV created by gnet_stats_start_copy and remembers the data for XSTATS if the dumping handle is in backward compatibility mode.

Returns 0 on success or -1 with the statistic lock released if the room in the socket buffer was not sufficient.


Name

gnet_stats_finish_copy — finish dumping procedure

Synopsis

int gnet_stats_finish_copy (d); 
struct gnet_dump *  d;

Arguments

d

dumping handle

Description

Corrects the length of the top level TLV to include all TLVs added by gnet_stats_copy_XXX calls. Adds the backward compatibility TLVs if gnet_stats_start_copy_compat was used and releases the statistics lock.

Returns 0 on success or -1 with the statistic lock released if the room in the socket buffer was not sufficient.


Name

gen_new_estimator — create a new rate estimator

Synopsis

int gen_new_estimator (bstats,  
 rate_est,  
 stats_lock,  
 opt); 
struct gnet_stats_basic_packed *  bstats;
struct gnet_stats_rate_est *  rate_est;
spinlock_t *  stats_lock;
struct nlattr *  opt;

Arguments

bstats

basic statistics

rate_est

rate estimator statistics

stats_lock

statistics lock

opt

rate estimator configuration TLV

Description

Creates a new rate estimator with bstats as source and rate_est as destination. A new timer with the interval specified in the configuration TLV is created. Upon each interval, the latest statistics will be read from bstats and the estimated rate will be stored in rate_est with the statistics lock grabed during this period.

Returns 0 on success or a negative error code.

NOTE

Called under rtnl_mutex


Name

gen_kill_estimator — remove a rate estimator

Synopsis

void gen_kill_estimator (bstats,  
 rate_est); 
struct gnet_stats_basic_packed *  bstats;
struct gnet_stats_rate_est *  rate_est;

Arguments

bstats

basic statistics

rate_est

rate estimator statistics

Description

Removes the rate estimator specified by bstats and rate_est.

NOTE

Called under rtnl_mutex


Name

gen_replace_estimator — replace rate estimator configuration

Synopsis

int gen_replace_estimator (bstats,  
 rate_est,  
 stats_lock,  
 opt); 
struct gnet_stats_basic_packed *  bstats;
struct gnet_stats_rate_est *  rate_est;
spinlock_t *  stats_lock;
struct nlattr *  opt;

Arguments

bstats

basic statistics

rate_est

rate estimator statistics

stats_lock

statistics lock

opt

rate estimator configuration TLV

Description

Replaces the configuration of a rate estimator by calling gen_kill_estimator and gen_new_estimator.

Returns 0 on success or a negative error code.


Name

gen_estimator_active — test if estimator is currently in use

Synopsis

bool gen_estimator_active (bstats,  
 rate_est); 
const struct gnet_stats_basic_packed *  bstats;
const struct gnet_stats_rate_est *  rate_est;

Arguments

bstats

basic statistics

rate_est

rate estimator statistics

Description

Returns true if estimator is active, and false if not.

SUN RPC subsystem

Name

xdr_encode_opaque_fixed — Encode fixed length opaque data

Synopsis

__be32 * xdr_encode_opaque_fixed (p,  
 ptr,  
 nbytes); 
__be32 *  p;
const void *  ptr;
unsigned int  nbytes;

Arguments

p

pointer to current position in XDR buffer.

ptr

pointer to data to encode (or NULL)

nbytes

size of data.

Description

Copy the array of data of length nbytes at ptr to the XDR buffer at position p, then align to the next 32-bit boundary by padding with zero bytes (see RFC1832).

Note

if ptr is NULL, only the padding is performed.

Returns the updated current XDR buffer position


Name

xdr_encode_opaque — Encode variable length opaque data

Synopsis

__be32 * xdr_encode_opaque (p,  
 ptr,  
 nbytes); 
__be32 *  p;
const void *  ptr;
unsigned int  nbytes;

Arguments

p

pointer to current position in XDR buffer.

ptr

pointer to data to encode (or NULL)

nbytes

size of data.

Description

Returns the updated current XDR buffer position


Name

xdr_init_encode — Initialize a struct xdr_stream for sending data.

Synopsis

void xdr_init_encode (xdr,  
 buf,  
 p); 
struct xdr_stream *  xdr;
struct xdr_buf *  buf;
__be32 *  p;

Arguments

xdr

pointer to xdr_stream struct

buf

pointer to XDR buffer in which to encode data

p

current pointer inside XDR buffer

Note

at the moment the RPC client only passes the length of our scratch buffer in the xdr_buf's header kvec. Previously this meant we needed to call xdr_adjust_iovec after encoding the data. With the new scheme, the xdr_stream manages the details of the buffer length, and takes care of adjusting the kvec length for us.


Name

xdr_reserve_space — Reserve buffer space for sending

Synopsis

__be32 * xdr_reserve_space (xdr,  
 nbytes); 
struct xdr_stream *  xdr;
size_t  nbytes;

Arguments

xdr

pointer to xdr_stream

nbytes

number of bytes to reserve

Description

Checks that we have enough buffer space to encode 'nbytes' more bytes of data. If so, update the total xdr_buf length, and adjust the length of the current kvec.


Name

xdr_write_pages — Insert a list of pages into an XDR buffer for sending

Synopsis

void xdr_write_pages (xdr,  
 pages,  
 base,  
 len); 
struct xdr_stream *  xdr;
struct page **  pages;
unsigned int  base;
unsigned int  len;

Arguments

xdr

pointer to xdr_stream

pages

list of pages

base

offset of first byte

len

length of data in bytes


Name

xdr_init_decode — Initialize an xdr_stream for decoding data.

Synopsis

void xdr_init_decode (xdr,  
 buf,  
 p); 
struct xdr_stream *  xdr;
struct xdr_buf *  buf;
__be32 *  p;

Arguments

xdr

pointer to xdr_stream struct

buf

pointer to XDR buffer from which to decode data

p

current pointer inside XDR buffer


Name

xdr_inline_decode — Retrieve non-page XDR data to decode

Synopsis

__be32 * xdr_inline_decode (xdr,  
 nbytes); 
struct xdr_stream *  xdr;
size_t  nbytes;

Arguments

xdr

pointer to xdr_stream struct

nbytes

number of bytes of data to decode

Description

Check if the input buffer is long enough to enable us to decode 'nbytes' more bytes of data starting at the current position. If so return the current pointer, then update the current pointer position.


Name

xdr_read_pages — Ensure page-based XDR data to decode is aligned at current pointer position

Synopsis

void xdr_read_pages (xdr,  
 len); 
struct xdr_stream *  xdr;
unsigned int  len;

Arguments

xdr

pointer to xdr_stream struct

len

number of bytes of page data

Description

Moves data beyond the current pointer position from the XDR head[] buffer into the page list. Any data that lies beyond current position + “len” bytes is moved into the XDR tail[].


Name

xdr_enter_page — decode data from the XDR page

Synopsis

void xdr_enter_page (xdr,  
 len); 
struct xdr_stream *  xdr;
unsigned int  len;

Arguments

xdr

pointer to xdr_stream struct

len

number of bytes of page data

Description

Moves data beyond the current pointer position from the XDR head[] buffer into the page list. Any data that lies beyond current position + “len” bytes is moved into the XDR tail[]. The current pointer is then repositioned at the beginning of the first XDR page.


Name

svc_print_addr — Format rq_addr field for printing

Synopsis

char * svc_print_addr (rqstp,  
 buf,  
 len); 
struct svc_rqst *  rqstp;
char *  buf;
size_t  len;

Arguments

rqstp

svc_rqst struct containing address to print

buf

target buffer for formatted address

len

length of target buffer


Name

svc_reserve — change the space reserved for the reply to a request.

Synopsis

void svc_reserve (rqstp,  
 space); 
struct svc_rqst *  rqstp;
int  space;

Arguments

rqstp

The request in question

space

new max space to reserve

Description

Each request reserves some space on the output queue of the transport to make sure the reply fits. This function reduces that reserved space to be the amount of space used already, plus space.


Name

svc_find_xprt — find an RPC transport instance

Synopsis

struct svc_xprt * svc_find_xprt (serv,  
 xcl_name,  
 af,  
 port); 
struct svc_serv *  serv;
const char *  xcl_name;
const sa_family_t  af;
const unsigned short  port;

Arguments

serv

pointer to svc_serv to search

xcl_name

C string containing transport's class name

af

Address family of transport's local address

port

transport's IP port number

Description

Return the transport instance pointer for the endpoint accepting connections/peer traffic from the specified transport class, address family and port.

Specifying 0 for the address family or port is effectively a wild-card, and will result in matching the first transport in the service's list that has a matching class name.


Name

svc_xprt_names — format a buffer with a list of transport names

Synopsis

int svc_xprt_names (serv,  
 buf,  
 buflen); 
struct svc_serv *  serv;
char *  buf;
const int  buflen;

Arguments

serv

pointer to an RPC service

buf

pointer to a buffer to be filled in

buflen

length of buffer to be filled in

Description

Fills in buf with a string containing a list of transport names, each name terminated with '\n'.

Returns positive length of the filled-in string on success; otherwise a negative errno value is returned if an error occurs.


Name

xprt_register_transport — register a transport implementation

Synopsis

int xprt_register_transport (transport); 
struct xprt_class *  transport;

Arguments

transport

transport to register

Description

If a transport implementation is loaded as a kernel module, it can call this interface to make itself known to the RPC client.

0

transport successfully registered -EEXIST: transport already registered -EINVAL: transport module being unloaded


Name

xprt_unregister_transport — unregister a transport implementation

Synopsis

int xprt_unregister_transport (transport); 
struct xprt_class *  transport;

Arguments

transport

transport to unregister

0

transport successfully unregistered -ENOENT: transport never registered


Name

xprt_load_transport — load a transport implementation

Synopsis

int xprt_load_transport (transport_name); 
const char *  transport_name;

Arguments

transport_name

transport to load

0

transport successfully loaded -ENOENT: transport module not available


Name

xprt_reserve_xprt — serialize write access to transports

Synopsis

int xprt_reserve_xprt (task); 
struct rpc_task *  task;

Arguments

task

task that is requesting access to the transport

Description

This prevents mixing the payload of separate requests, and prevents transport connects from colliding with writes. No congestion control is provided.


Name

xprt_release_xprt — allow other requests to use a transport

Synopsis

void xprt_release_xprt (xprt,  
 task); 
struct rpc_xprt *  xprt;
struct rpc_task *  task;

Arguments

xprt

transport with other tasks potentially waiting

task

task that is releasing access to the transport

Description

Note that “task” can be NULL. No congestion control is provided.


Name

xprt_release_xprt_cong — allow other requests to use a transport

Synopsis

void xprt_release_xprt_cong (xprt,  
 task); 
struct rpc_xprt *  xprt;
struct rpc_task *  task;

Arguments

xprt

transport with other tasks potentially waiting

task

task that is releasing access to the transport

Description

Note that “task” can be NULL. Another task is awoken to use the transport if the transport's congestion window allows it.


Name

xprt_release_rqst_cong — housekeeping when request is complete

Synopsis

void xprt_release_rqst_cong (task); 
struct rpc_task *  task;

Arguments

task

RPC request that recently completed

Description

Useful for transports that require congestion control.


Name

xprt_adjust_cwnd — adjust transport congestion window

Synopsis

void xprt_adjust_cwnd (task,  
 result); 
struct rpc_task *  task;
int  result;

Arguments

task

recently completed RPC request used to adjust window

result

result code of completed RPC request

Description

We use a time-smoothed congestion estimator to avoid heavy oscillation.


Name

xprt_wake_pending_tasks — wake all tasks on a transport's pending queue

Synopsis

void xprt_wake_pending_tasks (xprt,  
 status); 
struct rpc_xprt *  xprt;
int  status;

Arguments

xprt

transport with waiting tasks

status

result code to plant in each task before waking it


Name

xprt_wait_for_buffer_space — wait for transport output buffer to clear

Synopsis

void xprt_wait_for_buffer_space (task,  
 action); 
struct rpc_task *  task;
rpc_action  action;

Arguments

task

task to be put to sleep

action

function pointer to be executed after wait


Name

xprt_write_space — wake the task waiting for transport output buffer space

Synopsis

void xprt_write_space (xprt); 
struct rpc_xprt *  xprt;

Arguments

xprt

transport with waiting tasks

Description

Can be called in a soft IRQ context, so xprt_write_space never sleeps.


Name

xprt_set_retrans_timeout_def — set a request's retransmit timeout

Synopsis

void xprt_set_retrans_timeout_def (task); 
struct rpc_task *  task;

Arguments

task

task whose timeout is to be set

Description

Set a request's retransmit timeout based on the transport's default timeout parameters. Used by transports that don't adjust the retransmit timeout based on round-trip time estimation.


Name

xprt_disconnect_done — mark a transport as disconnected

Synopsis

void xprt_disconnect_done (xprt); 
struct rpc_xprt *  xprt;

Arguments

xprt

transport to flag for disconnect


Name

xprt_lookup_rqst — find an RPC request corresponding to an XID

Synopsis

struct rpc_rqst * xprt_lookup_rqst (xprt,  
 xid); 
struct rpc_xprt *  xprt;
__be32  xid;

Arguments

xprt

transport on which the original request was transmitted

xid

RPC XID of incoming reply


Name

xprt_update_rtt — update an RPC client's RTT state after receiving a reply

Synopsis

void xprt_update_rtt (task); 
struct rpc_task *  task;

Arguments

task

RPC request that recently completed


Name

xprt_complete_rqst — called when reply processing is complete

Synopsis

void xprt_complete_rqst (task,  
 copied); 
struct rpc_task *  task;
int  copied;

Arguments

task

RPC request that recently completed

copied

actual number of bytes received from the transport

Description

Caller holds transport lock.


Name

rpc_wake_up — wake up all rpc_tasks

Synopsis

void rpc_wake_up (queue); 
struct rpc_wait_queue *  queue;

Arguments

queue

rpc_wait_queue on which the tasks are sleeping

Description

Grabs queue->lock


Name

rpc_wake_up_status — wake up all rpc_tasks and set their status value.

Synopsis

void rpc_wake_up_status (queue,  
 status); 
struct rpc_wait_queue *  queue;
int  status;

Arguments

queue

rpc_wait_queue on which the tasks are sleeping

status

status value to set

Description

Grabs queue->lock


Name

rpc_malloc — allocate an RPC buffer

Synopsis

void * rpc_malloc (task,  
 size); 
struct rpc_task *  task;
size_t  size;

Arguments

task

RPC task that will use this buffer

size

requested byte size

Description

To prevent rpciod from hanging, this allocator never sleeps, returning NULL if the request cannot be serviced immediately. The caller can arrange to sleep in a way that is safe for rpciod.

Most requests are 'small' (under 2KiB) and can be serviced from a mempool, ensuring that NFS reads and writes can always proceed, and that there is good locality of reference for these buffers.

In order to avoid memory starvation triggering more writebacks of NFS requests, we avoid using GFP_KERNEL.


Name

rpc_free — free buffer allocated via rpc_malloc

Synopsis

void rpc_free (buffer); 
void *  buffer;

Arguments

buffer

buffer to free


Name

xdr_skb_read_bits — copy some data bits from skb to internal buffer

Synopsis

size_t xdr_skb_read_bits (desc,  
 to,  
 len); 
struct xdr_skb_reader *  desc;
void *  to;
size_t  len;

Arguments

desc

sk_buff copy helper

to

copy destination

len

number of bytes to copy

Description

Possibly called several times to iterate over an sk_buff and copy data out of it.


Name

xdr_partial_copy_from_skb — copy data out of an skb

Synopsis

ssize_t xdr_partial_copy_from_skb (xdr,  
 base,  
 desc,  
 copy_actor); 
struct xdr_buf *  xdr;
unsigned int  base;
struct xdr_skb_reader *  desc;
xdr_skb_read_actor  copy_actor;

Arguments

xdr

target XDR buffer

base

starting offset

desc

sk_buff copy helper

copy_actor

virtual method for copying data


Name

csum_partial_copy_to_xdr — checksum and copy data

Synopsis

int csum_partial_copy_to_xdr (xdr,  
 skb); 
struct xdr_buf *  xdr;
struct sk_buff *  skb;

Arguments

xdr

target XDR buffer

skb

source skb

Description

We have set things up such that we perform the checksum of the UDP packet in parallel with the copies into the RPC client iovec. -DaveM


Name

rpc_alloc_iostats — allocate an rpc_iostats structure

Synopsis

struct rpc_iostats * rpc_alloc_iostats (clnt); 
struct rpc_clnt *  clnt;

Arguments

clnt

RPC program, version, and xprt


Name

rpc_free_iostats — release an rpc_iostats structure

Synopsis

void rpc_free_iostats (stats); 
struct rpc_iostats *  stats;

Arguments

stats

doomed rpc_iostats structure


Name

rpc_queue_upcall —

Synopsis

int rpc_queue_upcall (inode,  
 msg); 
struct inode *  inode;
struct rpc_pipe_msg *  msg;

Arguments

inode

inode of upcall pipe on which to queue given message

msg

message to queue

Description

Call with an inode created by rpc_mkpipe to queue an upcall. A userspace process may then later read the upcall by performing a read on an open file for this inode. It is up to the caller to initialize the fields of msg (other than msg->list) appropriately.


Name

rpc_mkpipe — make an rpc_pipefs file for kernel<->userspace communication

Synopsis

struct dentry * rpc_mkpipe (parent,  
 name,  
 private,  
 ops,  
 flags); 
struct dentry *  parent;
const char *  name;
void *  private;
const struct rpc_pipe_ops *  ops;
int  flags;

Arguments

parent

dentry of directory to create new “pipe” in

name

name of pipe

private

private data to associate with the pipe, for the caller's use

ops

operations defining the behavior of the pipe: upcall, downcall, release_pipe, open_pipe, and destroy_msg.

flags

rpc_inode flags

Description

Data is made available for userspace to read by calls to rpc_queue_upcall. The actual reads will result in calls to ops->upcall, which will be called with the file pointer, message, and userspace buffer to copy to.

Writes can come at any time, and do not necessarily have to be responses to upcalls. They will result in calls to msg->downcall.

The private argument passed here will be available to all these methods from the file pointer, via RPC_I(file->f_dentry->d_inode)->private.


Name

rpc_unlink — remove a pipe

Synopsis

int rpc_unlink (dentry); 
struct dentry *  dentry;

Arguments

dentry

dentry for the pipe, as returned from rpc_mkpipe

Description

After this call, lookups will no longer find the pipe, and any attempts to read or write using preexisting opens of the pipe will return -EPIPE.


Name

rpcb_getport_sync — obtain the port for an RPC service on a given host

Synopsis

int rpcb_getport_sync (sin,  
 prog,  
 vers,  
 prot); 
struct sockaddr_in *  sin;
u32  prog;
u32  vers;
int  prot;

Arguments

sin

address of remote peer

prog

RPC program number to bind

vers

RPC version number to bind

prot

transport protocol to use to make this request

Description

Return value is the requested advertised port number, or a negative errno value.

Called from outside the RPC client in a synchronous task context. Uses default timeout parameters specified by underlying transport.

XXX

Needs to support IPv6


Name

rpcb_getport_async — obtain the port for a given RPC service on a given host

Synopsis

void rpcb_getport_async (task); 
struct rpc_task *  task;

Arguments

task

task that is waiting for portmapper request

Description

This one can be called for an ongoing RPC request, and can be used in an async (rpciod) context.


Name

rpc_bind_new_program — bind a new RPC program to an existing client

Synopsis

struct rpc_clnt * rpc_bind_new_program (old,  
 program,  
 vers); 
struct rpc_clnt *  old;
struct rpc_program *  program;
u32  vers;

Arguments

old

old rpc_client

program

rpc program to set

vers

rpc program version

Description

Clones the rpc client and sets up a new RPC program. This is mainly of use for enabling different RPC programs to share the same transport. The Sun NFSv2/v3 ACL protocol can do this.


Name

rpc_run_task — Allocate a new RPC task, then run rpc_execute against it

Synopsis

struct rpc_task * rpc_run_task (task_setup_data); 
const struct rpc_task_setup *  task_setup_data;

Arguments

task_setup_data

pointer to task initialisation data


Name

rpc_call_sync — Perform a synchronous RPC call

Synopsis

int rpc_call_sync (clnt,  
 msg,  
 flags); 
struct rpc_clnt *  clnt;
const struct rpc_message *  msg;
int  flags;

Arguments

clnt

pointer to RPC client

msg

RPC call parameters

flags

RPC call flags


Name

rpc_call_async — Perform an asynchronous RPC call

Synopsis

int rpc_call_async (clnt,  
 msg,  
 flags,  
 tk_ops,  
 data); 
struct rpc_clnt *  clnt;
const struct rpc_message *  msg;
int  flags;
const struct rpc_call_ops *  tk_ops;
void *  data;

Arguments

clnt

pointer to RPC client

msg

RPC call parameters

flags

RPC call flags

tk_ops

RPC call ops

data

user call data


Name

rpc_peeraddr — extract remote peer address from clnt's xprt

Synopsis

size_t rpc_peeraddr (clnt,  
 buf,  
 bufsize); 
struct rpc_clnt *  clnt;
struct sockaddr *  buf;
size_t  bufsize;

Arguments

clnt

RPC client structure

buf

target buffer

bufsize

length of target buffer

Description

Returns the number of bytes that are actually in the stored address.


Name

rpc_peeraddr2str — return remote peer address in printable format

Synopsis

const char * rpc_peeraddr2str (clnt,  
 format); 
struct rpc_clnt *  clnt;
enum rpc_display_format_t  format;

Arguments

clnt

RPC client structure

format

address format


Name

rpc_force_rebind — force transport to check that remote port is unchanged

Synopsis

void rpc_force_rebind (clnt); 
struct rpc_clnt *  clnt;

Arguments

clnt

client to rebind

WiMAX

Name

wimax_msg_alloc — Create a new skb for sending a message to userspace

Synopsis

struct sk_buff * wimax_msg_alloc (wimax_dev,  
 pipe_name,  
 msg,  
 size,  
 gfp_flags); 
struct wimax_dev *  wimax_dev;
const char *  pipe_name;
const void *  msg;
size_t  size;
gfp_t  gfp_flags;

Arguments

wimax_dev

WiMAX device descriptor

pipe_name

"named pipe" the message will be sent to

msg

pointer to the message data to send

size

size of the message to send (in bytes), including the header.

gfp_flags

flags for memory allocation.

Returns

0 if ok, negative errno code on error

Description

Allocates an skb that will contain the message to send to user space over the messaging pipe and initializes it, copying the payload.

Once this call is done, you can deliver it with wimax_msg_send.

IMPORTANT

Don't use skb_push/skb_pull/skb_reserve on the skb, as wimax_msg_send depends on skb->data being placed at the beginning of the user message.

Unlike other WiMAX stack calls, this call can be used way early, even before wimax_dev_add is called, as long as the wimax_dev->net_dev pointer is set to point to a proper net_dev. This is so that drivers can use it early in case they need to send stuff around or communicate with user space.


Name

wimax_msg_data_len — Return a pointer and size of a message's payload

Synopsis

const void * wimax_msg_data_len (msg,  
 size); 
struct sk_buff *  msg;
size_t *  size;

Arguments

msg

Pointer to a message created with wimax_msg_alloc

size

Pointer to where to store the message's size

Description

Returns the pointer to the message data.


Name

wimax_msg_data — Return a pointer to a message's payload

Synopsis

const void * wimax_msg_data (msg); 
struct sk_buff *  msg;

Arguments

msg

Pointer to a message created with wimax_msg_alloc


Name

wimax_msg_len — Return a message's payload length

Synopsis

ssize_t wimax_msg_len (msg); 
struct sk_buff *  msg;

Arguments

msg

Pointer to a message created with wimax_msg_alloc


Name

wimax_msg_send — Send a pre-allocated message to user space

Synopsis

int wimax_msg_send (wimax_dev,  
 skb); 
struct wimax_dev *  wimax_dev;
struct sk_buff *  skb;

Arguments

wimax_dev

WiMAX device descriptor

skb

struct sk_buff returned by wimax_msg_alloc. Note the ownership of skb is transferred to this function.

Returns

0 if ok, < 0 errno code on error

Description

Sends a free-form message that was preallocated with wimax_msg_alloc and filled up.

Assumes that once you pass an skb to this function for sending, it owns it and will release it when done (on success).

IMPORTANT

Don't use skb_push/skb_pull/skb_reserve on the skb, as wimax_msg_send depends on skb->data being placed at the beginning of the user message.

Unlike other WiMAX stack calls, this call can be used way early, even before wimax_dev_add is called, as long as the wimax_dev->net_dev pointer is set to point to a proper net_dev. This is so that drivers can use it early in case they need to send stuff around or communicate with user space.


Name

wimax_msg — Send a message to user space

Synopsis

int wimax_msg (wimax_dev,  
 pipe_name,  
 buf,  
 size,  
 gfp_flags); 
struct wimax_dev *  wimax_dev;
const char *  pipe_name;
const void *  buf;
size_t  size;
gfp_t  gfp_flags;

Arguments

wimax_dev

WiMAX device descriptor (properly referenced)

pipe_name

"named pipe" the message will be sent to

buf

pointer to the message to send.

size

size of the buffer pointed to by buf (in bytes).

gfp_flags

flags for memory allocation.

Returns

0 if ok, negative errno code on error.

Description

Sends a free-form message to user space on the device wimax_dev.

NOTES

Once the skb is given to this function, who will own it and will release it when done (unless it returns error).


Name

wimax_reset — Reset a WiMAX device

Synopsis

int wimax_reset (wimax_dev); 
struct wimax_dev *  wimax_dev;

Arguments

wimax_dev

WiMAX device descriptor

Returns

0 if ok and a warm reset was done (the device still exists in the system).

-ENODEV if a cold/bus reset had to be done (device has disconnected and reconnected, so current handle is not valid any more).

-EINVAL if the device is not even registered.

Any other negative error code shall be considered as non-recoverable.

Description

Called when wanting to reset the device for any reason. Device is taken back to power on status.

This call blocks; on succesful return, the device has completed the reset process and is ready to operate.


Name

wimax_report_rfkill_hw — Reports changes in the hardware RF switch

Synopsis

void wimax_report_rfkill_hw (wimax_dev,  
 state); 
struct wimax_dev *  wimax_dev;
enum wimax_rf_state  state;

Arguments

wimax_dev

WiMAX device descriptor

state

New state of the RF Kill switch. WIMAX_RF_ON radio on, WIMAX_RF_OFF radio off.

Description

When the device detects a change in the state of thehardware RF switch, it must call this function to let the WiMAX kernel stack know that the state has changed so it can be properly propagated.

The WiMAX stack caches the state (the driver doesn't need to). As well, as the change is propagated it will come back as a request to change the software state to mirror the hardware state.

If the device doesn't have a hardware kill switch, just report it on initialization as always on (WIMAX_RF_ON, radio on).


Name

wimax_report_rfkill_sw — Reports changes in the software RF switch

Synopsis

void wimax_report_rfkill_sw (wimax_dev,  
 state); 
struct wimax_dev *  wimax_dev;
enum wimax_rf_state  state;

Arguments

wimax_dev

WiMAX device descriptor

state

New state of the RF kill switch. WIMAX_RF_ON radio on, WIMAX_RF_OFF radio off.

Description

Reports changes in the software RF switch state to the the WiMAX stack.

The main use is during initialization, so the driver can query the device for its current software radio kill switch state and feed it to the system.

On the side, the device does not change the software state by itself. In practice, this can happen, as the device might decide to switch (in software) the radio off for different reasons.


Name

wimax_rfkill — Set the software RF switch state for a WiMAX device

Synopsis

int wimax_rfkill (wimax_dev,  
 state); 
struct wimax_dev *  wimax_dev;
enum wimax_rf_state  state;

Arguments

wimax_dev

WiMAX device descriptor

state

New RF state.

Returns

>= 0 toggle state if ok, < 0 errno code on error. The toggle state is returned as a bitmap, bit 0 being the hardware RF state, bit 1 the software RF state.

0 means disabled (WIMAX_RF_ON, radio on), 1 means enabled radio off (WIMAX_RF_OFF).

Description

Called by the user when he wants to request the WiMAX radio to be switched on (WIMAX_RF_ON) or off (WIMAX_RF_OFF). With WIMAX_RF_QUERY, just the current state is returned.

NOTE

This call will block until the operation is complete.


Name

wimax_state_change — Set the current state of a WiMAX device

Synopsis

void wimax_state_change (wimax_dev,  
 new_state); 
struct wimax_dev *  wimax_dev;
enum wimax_st  new_state;

Arguments

wimax_dev

WiMAX device descriptor (properly referenced)

new_state

New state to switch to

Description

This implements the state changes for the wimax devices. It will

- verify that the state transition is legal (for now it'll just print a warning if not) according to the table in linux/wimax.h's documentation for 'enum wimax_st'.

- perform the actions needed for leaving the current state and whichever are needed for entering the new state.

- issue a report to user space indicating the new state (and an optional payload with information about the new state).

NOTE

wimax_dev must be locked


Name

wimax_state_get — Return the current state of a WiMAX device

Synopsis

enum wimax_st wimax_state_get (wimax_dev); 
struct wimax_dev *  wimax_dev;

Arguments

wimax_dev

WiMAX device descriptor

Returns

Current state of the device according to its driver.


Name

wimax_dev_init — initialize a newly allocated instance

Synopsis

void wimax_dev_init (wimax_dev); 
struct wimax_dev *  wimax_dev;

Arguments

wimax_dev

WiMAX device descriptor to initialize.

Description

Initializes fields of a freshly allocated wimax_dev instance. This function assumes that after allocation, the memory occupied by wimax_dev was zeroed.


Name

wimax_dev_add — Register a new WiMAX device

Synopsis

int wimax_dev_add (wimax_dev,  
 net_dev); 
struct wimax_dev *  wimax_dev;
struct net_device *  net_dev;

Arguments

wimax_dev

WiMAX device descriptor (as embedded in your net_dev's priv data). You must have called wimax_dev_init on it before.

net_dev

net device the wimax_dev is associated with. The function expects SET_NETDEV_DEV and register_netdev were already called on it.

Description

Registers the new WiMAX device, sets up the user-kernel control interface (generic netlink) and common WiMAX infrastructure.

Note that the parts that will allow interaction with user space are setup at the very end, when the rest is in place, as once that happens, the driver might get user space control requests via netlink or from debugfs that might translate into calls into wimax_dev->op_*().


Name

wimax_dev_rm — Unregister an existing WiMAX device

Synopsis

void wimax_dev_rm (wimax_dev); 
struct wimax_dev *  wimax_dev;

Arguments

wimax_dev

WiMAX device descriptor

Description

Unregisters a WiMAX device previously registered for use with wimax_add_rm.

IMPORTANT! Must call before calling unregister_netdev.

After this function returns, you will not get any more user space control requests (via netlink or debugfs) and thus to wimax_dev->ops.

Reentrancy control is ensured by setting the state to __WIMAX_ST_QUIESCING. rfkill operations coming through wimax_*rfkill*() will be stopped by the quiescing state; ops coming from the rfkill subsystem will be stopped by the support being removed by wimax_rfkill_rm.


Name

struct wimax_dev — Generic WiMAX device

Synopsis

struct wimax_dev {
  struct net_device * net_dev;
  struct list_head id_table_node;
  struct mutex mutex;
  struct mutex mutex_reset;
  enum wimax_st state;
  int (* op_msg_from_user) (struct wimax_dev *wimax_dev,const char *,const void *, size_t,const struct genl_info *info);
  int (* op_rfkill_sw_toggle) (struct wimax_dev *wimax_dev,enum wimax_rf_state);
  int (* op_reset) (struct wimax_dev *wimax_dev);
  struct rfkill * rfkill;
  unsigned rf_hw;
  unsigned rf_sw;
  char name[32];
  struct dentry * debugfs_dentry;
};  

Members

net_dev

[fill] Pointer to the struct net_device this WiMAX device implements.

id_table_node

[private] link to the list of wimax devices kept by id-table.c. Protected by it's own spinlock.

mutex

[private] Serializes all concurrent access and execution of operations.

mutex_reset

[private] Serializes reset operations. Needs to be a different mutex because as part of the reset operation, the driver has to call back into the stack to do things such as state change, that require wimax_dev->mutex.

state

[private] Current state of the WiMAX device.

op_msg_from_user

[fill] Driver-specific operation to handle a raw message from user space to the driver. The driver can send messages to user space using with wimax_msg_to_user.

op_rfkill_sw_toggle

[fill] Driver-specific operation to act on userspace (or any other agent) requesting the WiMAX device to change the RF Kill software switch (WIMAX_RF_ON or WIMAX_RF_OFF). If such hardware support is not present, it is assumed the radio cannot be switched off and it is always on (and the stack will error out when trying to switch it off). In such case, this function pointer can be left as NULL.

op_reset

[fill] Driver specific operation to reset the device. This operation should always attempt first a warm reset that does not disconnect the device from the bus and return 0. If that fails, it should resort to some sort of cold or bus reset (even if it implies a bus disconnection and device dissapearance). In that case, -ENODEV should be returned to indicate the device is gone. This operation has to be synchronous, and return only when the reset is complete. In case of having had to resort to bus/cold reset implying a device disconnection, the call is allowed to return inmediately.

rfkill

[private] integration into the RF-Kill infrastructure.

rf_hw

[private] State of the hardware radio switch (OFF/ON)

rf_sw

[private] State of the software radio switch (OFF/ON)

name[32]

[fill] A way to identify this device. We need to register a name with many subsystems (rfkill, workqueue creation, etc). We can't use the network device name as that might change and in some instances we don't know it yet (until we don't call register_netdev). So we generate an unique one using the driver name and device bus id, place it here and use it across the board. Recommended naming: DRIVERNAME-BUSNAME:BUSID (dev->bus->name, dev->bus_id).

debugfs_dentry

[private] Used to hook up a debugfs entry. This shows up in the debugfs root as wimax\:DEVICENAME.

NOTE

wimax_dev->mutex is NOT locked when this op is being called; however, wimax_dev->mutex_reset IS locked to ensure serialization of calls to wimax_reset. See wimax_reset's documentation.

Description

This structure defines a common interface to access all WiMAX devices from different vendors and provides a common API as well as a free-form device-specific messaging channel.

Usage

1. Embed a struct wimax_dev at *the beginning* the network device structure so that netdev_priv points to it.

2. memset it to zero

3. Initialize with wimax_dev_init. This will leave the WiMAX device in the __WIMAX_ST_NULL state.

4. Fill all the fields marked with [fill]; once called wimax_dev_add, those fields CANNOT be modified.

5. Call wimax_dev_add *after* registering the network device. This will leave the WiMAX device in the WIMAX_ST_DOWN state. Protect the driver's net_device->open against succeeding if the wimax device state is lower than WIMAX_ST_DOWN.

6. Select when the device is going to be turned on/initialized; for example, it could be initialized on 'ifconfig up' (when the netdev op 'open' is called on the driver).

When the device is initialized (at `ifconfig up` time, or right after calling wimax_dev_add from _probe, make sure the following steps are taken

a. Move the device to WIMAX_ST_UNINITIALIZED. This is needed so some API calls that shouldn't work until the device is ready can be blocked.

b. Initialize the device. Make sure to turn the SW radio switch off and move the device to state WIMAX_ST_RADIO_OFF when done. When just initialized, a device should be left in RADIO OFF state until user space devices to turn it on.

c. Query the device for the state of the hardware rfkill switch and call wimax_rfkill_report_hw and wimax_rfkill_report_sw as needed. See below.

wimax_dev_rm undoes before unregistering the network device. Once wimax_dev_add is called, the driver can get called on the wimax_dev->op_* function pointers

CONCURRENCY

The stack provides a mutex for each device that will disallow API calls happening concurrently; thus, op calls into the driver through the wimax_dev->op*() function pointers will always be serialized and *never* concurrent.

For locking, take wimax_dev->mutex is taken; (most) operations in the API have to check for wimax_dev_is_ready to return 0 before continuing (this is done internally).

REFERENCE COUNTING

The WiMAX device is reference counted by the associated network device. The only operation that can be used to reference the device is wimax_dev_get_by_genl_info, and the reference it acquires has to be released with dev_put(wimax_dev->net_dev).

RFKILL

At startup, both HW and SW radio switchess are assumed to be off.

At initialization time [after calling wimax_dev_add], have the driver query the device for the status of the software and hardware RF kill switches and call wimax_report_rfkill_hw and wimax_rfkill_report_sw to indicate their state. If any is missing, just call it to indicate it is ON (radio always on).

Whenever the driver detects a change in the state of the RF kill switches, it should call wimax_report_rfkill_hw or wimax_report_rfkill_sw to report it to the stack.


Name

enum wimax_st — The different states of a WiMAX device

Synopsis

enum wimax_st {
  __WIMAX_ST_NULL,
  WIMAX_ST_DOWN,
  __WIMAX_ST_QUIESCING,
  WIMAX_ST_UNINITIALIZED,
  WIMAX_ST_RADIO_OFF,
  WIMAX_ST_READY,
  WIMAX_ST_SCANNING,
  WIMAX_ST_CONNECTING,
  WIMAX_ST_CONNECTED,
  __WIMAX_ST_INVALID
};  

Constants

__WIMAX_ST_NULL

The device structure has been allocated and zeroed, but still wimax_dev_add hasn't been called. There is no state.

WIMAX_ST_DOWN

The device has been registered with the WiMAX and networking stacks, but it is not initialized (normally that is done with 'ifconfig DEV up' [or equivalent], which can upload firmware and enable communications with the device). In this state, the device is powered down and using as less power as possible. This state is the default after a call to wimax_dev_add. It is ok to have drivers move directly to WIMAX_ST_UNINITIALIZED or WIMAX_ST_RADIO_OFF in _probe after the call to wimax_dev_add. It is recommended that the driver leaves this state when calling 'ifconfig DEV up' and enters it back on 'ifconfig DEV down'.

__WIMAX_ST_QUIESCING

The device is being torn down, so no API operations are allowed to proceed except the ones needed to complete the device clean up process.

WIMAX_ST_UNINITIALIZED

[optional] Communication with the device is setup, but the device still requires some configuration before being operational. Some WiMAX API calls might work.

WIMAX_ST_RADIO_OFF

The device is fully up; radio is off (wether by hardware or software switches). It is recommended to always leave the device in this state after initialization.

WIMAX_ST_READY

The device is fully up and radio is on.

WIMAX_ST_SCANNING

[optional] The device has been instructed to scan. In this state, the device cannot be actively connected to a network.

WIMAX_ST_CONNECTING

The device is connecting to a network. This state exists because in some devices, the connect process can include a number of negotiations between user space, kernel space and the device. User space needs to know what the device is doing. If the connect sequence in a device is atomic and fast, the device can transition directly to CONNECTED

WIMAX_ST_CONNECTED

The device is connected to a network.

__WIMAX_ST_INVALID

This is an invalid state used to mark the maximum numeric value of states.

Description

Transitions from one state to another one are atomic and can only be caused in kernel space with wimax_state_change. To read the state, use wimax_state_get.

States starting with __ are internal and shall not be used or referred to by drivers or userspace. They look ugly, but that's the point -- if any use is made non-internal to the stack, it is easier to catch on review.

All API operations [with well defined exceptions] will take the device mutex before starting and then check the state. If the state is __WIMAX_ST_NULL, WIMAX_ST_DOWN, WIMAX_ST_UNINITIALIZED or __WIMAX_ST_QUIESCING, it will drop the lock and quit with -EINVAL, -ENOMEDIUM, -ENOTCONN or -ESHUTDOWN.

The order of the definitions is important, so we can do numerical comparisons (eg: < WIMAX_ST_RADIO_OFF means the device is not ready to operate).

Chapter 2. Network device support

Table of Contents

Driver Support
PHY Support

Driver Support

Name

dev_add_pack — add packet handler

Synopsis

void dev_add_pack (pt); 
struct packet_type *  pt;

Arguments

pt

packet type declaration

Description

Add a protocol handler to the networking stack. The passed packet_type is linked into kernel lists and may not be freed until it has been removed from the kernel lists.

This call does not sleep therefore it can not guarantee all CPU's that are in middle of receiving packets will see the new packet type (until the next received packet).


Name

__dev_remove_pack — remove packet handler

Synopsis

void __dev_remove_pack (pt); 
struct packet_type *  pt;

Arguments

pt

packet type declaration

Description

Remove a protocol handler that was previously added to the kernel protocol handlers by dev_add_pack. The passed packet_type is removed from the kernel lists and can be freed or reused once this function returns.

The packet type might still be in use by receivers and must not be freed until after all the CPU's have gone through a quiescent state.


Name

dev_remove_pack — remove packet handler

Synopsis

void dev_remove_pack (pt); 
struct packet_type *  pt;

Arguments

pt

packet type declaration

Description

Remove a protocol handler that was previously added to the kernel protocol handlers by dev_add_pack. The passed packet_type is removed from the kernel lists and can be freed or reused once this function returns.

This call sleeps to guarantee that no CPU is looking at the packet type after return.


Name

netdev_boot_setup_check — check boot time settings

Synopsis

int netdev_boot_setup_check (dev); 
struct net_device *  dev;

Arguments

dev

the netdevice

Description

Check boot time settings for the device. The found settings are set for the device to be used later in the device probing. Returns 0 if no settings found, 1 if they are.


Name

__dev_get_by_name — find a device by its name

Synopsis

struct net_device * __dev_get_by_name (net,  
 name); 
struct net *  net;
const char *  name;

Arguments

net

the applicable net namespace

name

name to find

Description

Find an interface by name. Must be called under RTNL semaphore or dev_base_lock. If the name is found a pointer to the device is returned. If the name is not found then NULL is returned. The reference counters are not incremented so the caller must be careful with locks.


Name

dev_get_by_name — find a device by its name

Synopsis

struct net_device * dev_get_by_name (net,  
 name); 
struct net *  net;
const char *  name;

Arguments

net

the applicable net namespace

name

name to find

Description

Find an interface by name. This can be called from any context and does its own locking. The returned handle has the usage count incremented and the caller must use dev_put to release it when it is no longer needed. NULL is returned if no matching device is found.


Name

__dev_get_by_index — find a device by its ifindex

Synopsis

struct net_device * __dev_get_by_index (net,  
 ifindex); 
struct net *  net;
int  ifindex;

Arguments

net

the applicable net namespace

ifindex

index of device

Description

Search for an interface by index. Returns NULL if the device is not found or a pointer to the device. The device has not had its reference counter increased so the caller must be careful about locking. The caller must hold either the RTNL semaphore or dev_base_lock.


Name

dev_get_by_index — find a device by its ifindex

Synopsis

struct net_device * dev_get_by_index (net,  
 ifindex); 
struct net *  net;
int  ifindex;

Arguments

net

the applicable net namespace

ifindex

index of device

Description

Search for an interface by index. Returns NULL if the device is not found or a pointer to the device. The device returned has had a reference added and the pointer is safe until the user calls dev_put to indicate they have finished with it.


Name

dev_getbyhwaddr — find a device by its hardware address

Synopsis

struct net_device * dev_getbyhwaddr (net,  
 type,  
 ha); 
struct net *  net;
unsigned short  type;
char *  ha;

Arguments

net

the applicable net namespace

type

media type of device

ha

hardware address

Description

Search for an interface by MAC address. Returns NULL if the device is not found or a pointer to the device. The caller must hold the rtnl semaphore. The returned device has not had its ref count increased and the caller must therefore be careful about locking

BUGS

If the API was consistent this would be __dev_get_by_hwaddr


Name

dev_get_by_flags — find any device with given flags

Synopsis

struct net_device * dev_get_by_flags (net,  
 if_flags,  
 mask); 
struct net *  net;
unsigned short  if_flags;
unsigned short  mask;

Arguments

net

the applicable net namespace

if_flags

IFF_* values

mask

bitmask of bits in if_flags to check

Description

Search for any interface with the given flags. Returns NULL if a device is not found or a pointer to the device. The device returned has had a reference added and the pointer is safe until the user calls dev_put to indicate they have finished with it.


Name

dev_valid_name — check if name is okay for network device

Synopsis

int dev_valid_name (name); 
const char *  name;

Arguments

name

name string

Description

Network device names need to be valid file names to to allow sysfs to work. We also disallow any kind of whitespace.


Name

dev_alloc_name — allocate a name for a device

Synopsis

int dev_alloc_name (dev,  
 name); 
struct net_device *  dev;
const char *  name;

Arguments

dev

device

name

name format string

Description

Passed a format string - eg “ltd” it will try and find a suitable id. It scans list of devices to build up a free map, then chooses the first empty slot. The caller must hold the dev_base or rtnl lock while allocating the name and adding the device in order to avoid duplicates. Limited to bits_per_byte * page size devices (ie 32K on most platforms). Returns the number of the unit assigned or a negative errno code.


Name

netdev_features_change — device changes features

Synopsis

void netdev_features_change (dev); 
struct net_device *  dev;

Arguments

dev

device to cause notification

Description

Called to indicate a device has changed features.


Name

netdev_state_change — device changes state

Synopsis

void netdev_state_change (dev); 
struct net_device *  dev;

Arguments

dev

device to cause notification

Description

Called to indicate a device has changed state. This function calls the notifier chains for netdev_chain and sends a NEWLINK message to the routing socket.


Name

dev_load — load a network module

Synopsis

void dev_load (net,  
 name); 
struct net *  net;
const char *  name;

Arguments

net

the applicable net namespace

name

name of interface

Description

If a network interface is not present and the process has suitable privileges this function loads the module. If module loading is not available in this kernel then it becomes a nop.


Name

dev_open — prepare an interface for use.

Synopsis

int dev_open (dev); 
struct net_device *  dev;

Arguments

dev

device to open

Description

Takes a device from down to up state. The device's private open function is invoked and then the multicast lists are loaded. Finally the device is moved into the up state and a NETDEV_UP message is sent to the netdev notifier chain.

Calling this function on an active interface is a nop. On a failure a negative errno code is returned.


Name

dev_close — shutdown an interface.

Synopsis

int dev_close (dev); 
struct net_device *  dev;

Arguments

dev

device to shutdown

Description

This function moves an active device into down state. A NETDEV_GOING_DOWN is sent to the netdev notifier chain. The device is then deactivated and finally a NETDEV_DOWN is sent to the notifier chain.


Name

dev_disable_lro — disable Large Receive Offload on a device

Synopsis

void dev_disable_lro (dev); 
struct net_device *  dev;

Arguments

dev

device

Description

Disable Large Receive Offload (LRO) on a net device. Must be called under RTNL. This is needed if received packets may be forwarded to another interface.


Name

register_netdevice_notifier — register a network notifier block

Synopsis

int register_netdevice_notifier (nb); 
struct notifier_block *  nb;

Arguments

nb

notifier

Description

Register a notifier to be called when network device events occur. The notifier passed is linked into the kernel structures and must not be reused until it has been unregistered. A negative errno code is returned on a failure.

When registered all registration and up events are replayed to the new notifier to allow device to have a race free view of the network device list.


Name

unregister_netdevice_notifier — unregister a network notifier block

Synopsis

int unregister_netdevice_notifier (nb); 
struct notifier_block *  nb;

Arguments

nb

notifier

Description

Unregister a notifier previously registered by register_netdevice_notifier. The notifier is unlinked into the kernel structures and may then be reused. A negative errno code is returned on a failure.


Name

netif_device_detach — mark device as removed

Synopsis

void netif_device_detach (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Mark device as removed from system and therefore no longer available.


Name

netif_device_attach — mark device as attached

Synopsis

void netif_device_attach (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Mark device as attached from system and restart if needed.


Name

skb_gso_segment — Perform segmentation on skb.

Synopsis

struct sk_buff * skb_gso_segment (skb,  
 features); 
struct sk_buff *  skb;
int  features;

Arguments

skb

buffer to segment

features

features for the output path (see dev->features)

Description

This function segments the given skb and returns a list of segments.

It may return NULL if the skb requires no segmentation. This is only possible when GSO is used for verifying header integrity.


Name

dev_queue_xmit — transmit a buffer

Synopsis

int dev_queue_xmit (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to transmit

Description

Queue a buffer for transmission to a network device. The caller must have set the device and priority and built the buffer before calling this function. The function can be called from an interrupt.

A negative errno code is returned on a failure. A success does not guarantee the frame will be transmitted as it may be dropped due to congestion or traffic shaping.

----------------------------------------------------------------------------------- I notice this method can also return errors from the queue disciplines, including NET_XMIT_DROP, which is a positive value. So, errors can also be positive.

Regardless of the return value, the skb is consumed, so it is currently difficult to retry a send to this method. (You can bump the ref count before sending to hold a reference for retry if you are careful.)

When calling this method, interrupts MUST be enabled. This is because the BH enable code must have IRQs enabled so that it will not deadlock. --BLG


Name

netif_rx — post buffer to the network code

Synopsis

int netif_rx (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to post

Description

This function receives a packet from a device driver and queues it for the upper (protocol) levels to process. It always succeeds. The buffer may be dropped during processing for congestion control or by the protocol layers.

return values

NET_RX_SUCCESS (no congestion) NET_RX_DROP (packet was dropped)


Name

netif_receive_skb — process receive buffer from network

Synopsis

int netif_receive_skb (skb); 
struct sk_buff *  skb;

Arguments

skb

buffer to process

Description

netif_receive_skb is the main receive data processing function. It always succeeds. The buffer may be dropped during processing for congestion control or by the protocol layers.

This function may only be called from softirq context and interrupts should be enabled.

Return values (usually ignored):

NET_RX_SUCCESS

no congestion

NET_RX_DROP

packet was dropped


Name

__napi_schedule — schedule for receive

Synopsis

void __napi_schedule (n); 
struct napi_struct *  n;

Arguments

n

entry to schedule

Description

The entry's receive function will be scheduled to run


Name

register_gifconf — register a SIOCGIF handler

Synopsis

int register_gifconf (family,  
 gifconf); 
unsigned int  family;
gifconf_func_t *  gifconf;

Arguments

family

Address family

gifconf

Function handler

Description

Register protocol dependent address dumping routines. The handler that is passed must not be freed or reused until it has been replaced by another handler.


Name

netdev_set_master — set up master/slave pair

Synopsis

int netdev_set_master (slave,  
 master); 
struct net_device *  slave;
struct net_device *  master;

Arguments

slave

slave device

master

new master device

Description

Changes the master device of the slave. Pass NULL to break the bonding. The caller must hold the RTNL semaphore. On a failure a negative errno code is returned. On success the reference counts are adjusted, RTM_NEWLINK is sent to the routing socket and the function returns zero.


Name

dev_set_promiscuity — update promiscuity count on a device

Synopsis

int dev_set_promiscuity (dev,  
 inc); 
struct net_device *  dev;
int  inc;

Arguments

dev

device

inc

modifier

Description

Add or remove promiscuity from a device. While the count in the device remains above zero the interface remains promiscuous. Once it hits zero the device reverts back to normal filtering operation. A negative inc value is used to drop promiscuity on the device. Return 0 if successful or a negative errno code on error.


Name

dev_set_allmulti — update allmulti count on a device

Synopsis

int dev_set_allmulti (dev,  
 inc); 
struct net_device *  dev;
int  inc;

Arguments

dev

device

inc

modifier

Description

Add or remove reception of all multicast frames to a device. While the count in the device remains above zero the interface remains listening to all interfaces. Once it hits zero the device reverts back to normal filtering operation. A negative inc value is used to drop the counter when releasing a resource needing all multicasts. Return 0 if successful or a negative errno code on error.


Name

dev_addr_add — Add a device address

Synopsis

int dev_addr_add (dev,  
 addr,  
 addr_type); 
struct net_device *  dev;
unsigned char *  addr;
unsigned char  addr_type;

Arguments

dev

device

addr

address to add

addr_type

address type

Description

Add a device address to the device or increase the reference count if it already exists.

The caller must hold the rtnl_mutex.


Name

dev_addr_del — Release a device address.

Synopsis

int dev_addr_del (dev,  
 addr,  
 addr_type); 
struct net_device *  dev;
unsigned char *  addr;
unsigned char  addr_type;

Arguments

dev

device

addr

address to delete

addr_type

address type

Description

Release reference to a device address and remove it from the device if the reference count drops to zero.

The caller must hold the rtnl_mutex.


Name

dev_addr_add_multiple — Add device addresses from another device

Synopsis

int dev_addr_add_multiple (to_dev,  
 from_dev,  
 addr_type); 
struct net_device *  to_dev;
struct net_device *  from_dev;
unsigned char  addr_type;

Arguments

to_dev

device to which addresses will be added

from_dev

device from which addresses will be added

addr_type

address type - 0 means type will be used from from_dev

Description

Add device addresses of the one device to another. * The caller must hold the rtnl_mutex.


Name

dev_addr_del_multiple — Delete device addresses by another device

Synopsis

int dev_addr_del_multiple (to_dev,  
 from_dev,  
 addr_type); 
struct net_device *  to_dev;
struct net_device *  from_dev;
unsigned char  addr_type;

Arguments

to_dev

device where the addresses will be deleted

from_dev

device by which addresses the addresses will be deleted

addr_type

address type - 0 means type will used from from_dev

Description

Deletes addresses in to device by the list of addresses in from device.

The caller must hold the rtnl_mutex.


Name

dev_unicast_delete — Release secondary unicast address.

Synopsis

int dev_unicast_delete (dev,  
 addr); 
struct net_device *  dev;
void *  addr;

Arguments

dev

device

addr

address to delete

Description

Release reference to a secondary unicast address and remove it from the device if the reference count drops to zero.

The caller must hold the rtnl_mutex.


Name

dev_unicast_add — add a secondary unicast address

Synopsis

int dev_unicast_add (dev,  
 addr); 
struct net_device *  dev;
void *  addr;

Arguments

dev

device

addr

address to add

Description

Add a secondary unicast address to the device or increase the reference count if it already exists.

The caller must hold the rtnl_mutex.


Name

dev_unicast_sync — Synchronize device's unicast list to another device

Synopsis

int dev_unicast_sync (to,  
 from); 
struct net_device *  to;
struct net_device *  from;

Arguments

to

destination device

from

source device

Description

Add newly added addresses to the destination device and release addresses that have no users left. The source device must be locked by netif_tx_lock_bh.

This function is intended to be called from the dev->set_rx_mode function of layered software devices.


Name

dev_unicast_unsync — Remove synchronized addresses from the destination device

Synopsis

void dev_unicast_unsync (to,  
 from); 
struct net_device *  to;
struct net_device *  from;

Arguments

to

destination device

from

source device

Description

Remove all addresses that were added to the destination device by dev_unicast_sync. This function is intended to be called from the dev->stop function of layered software devices.


Name

dev_get_flags — get flags reported to userspace

Synopsis

unsigned dev_get_flags (dev); 
const struct net_device *  dev;

Arguments

dev

device

Description

Get the combination of flag bits exported through APIs to userspace.


Name

dev_change_flags — change device settings

Synopsis

int dev_change_flags (dev,  
 flags); 
struct net_device *  dev;
unsigned  flags;

Arguments

dev

device

flags

device state flags

Description

Change settings on device based state flags. The flags are in the userspace exported format.


Name

dev_set_mtu — Change maximum transfer unit

Synopsis

int dev_set_mtu (dev,  
 new_mtu); 
struct net_device *  dev;
int  new_mtu;

Arguments

dev

device

new_mtu

new transfer unit

Description

Change the maximum transfer size of the network device.


Name

dev_set_mac_address — Change Media Access Control Address

Synopsis

int dev_set_mac_address (dev,  
 sa); 
struct net_device *  dev;
struct sockaddr *  sa;

Arguments

dev

device

sa

new address

Description

Change the hardware (MAC) address of the device


Name

register_netdevice — register a network device

Synopsis

int register_netdevice (dev); 
struct net_device *  dev;

Arguments

dev

device to register

Description

Take a completed network device structure and add it to the kernel interfaces. A NETDEV_REGISTER message is sent to the netdev notifier chain. 0 is returned on success. A negative errno code is returned on a failure to set up the device, or if the name is a duplicate.

Callers must hold the rtnl semaphore. You may want register_netdev instead of this.

BUGS

The locking appears insufficient to guarantee two parallel registers will not get the same name.


Name

init_dummy_netdev — init a dummy network device for NAPI

Synopsis

int init_dummy_netdev (dev); 
struct net_device *  dev;

Arguments

dev

device to init

Description

This takes a network device structure and initialize the minimum amount of fields so it can be used to schedule NAPI polls without registering a full blown interface. This is to be used by drivers that need to tie several hardware interfaces to a single NAPI poll scheduler due to HW limitations.


Name

register_netdev — register a network device

Synopsis

int register_netdev (dev); 
struct net_device *  dev;

Arguments

dev

device to register

Description

Take a completed network device structure and add it to the kernel interfaces. A NETDEV_REGISTER message is sent to the netdev notifier chain. 0 is returned on success. A negative errno code is returned on a failure to set up the device, or if the name is a duplicate.

This is a wrapper around register_netdevice that takes the rtnl semaphore and expands the device name if you passed a format string to alloc_netdev.


Name

dev_get_stats — get network device statistics

Synopsis

const struct net_device_stats * dev_get_stats (dev); 
struct net_device *  dev;

Arguments

dev

device to get statistics from

Description

Get network statistics from device. The device driver may provide its own method by setting dev->netdev_ops->get_stats; otherwise the internal statistics structure is used.


Name

alloc_netdev_mq — allocate network device

Synopsis

struct net_device * alloc_netdev_mq (sizeof_priv,  
 name,  
 setup,  
 queue_count); 
int  sizeof_priv;
const char *  name;
void (* setup(struct net_device *);
unsigned int  queue_count;

Arguments

sizeof_priv

size of private data to allocate space for

name

device name format string

setup

callback to initialize device

queue_count

the number of subqueues to allocate

Description

Allocates a struct net_device with private data area for driver use and performs basic initialization. Also allocates subquue structs for each queue on the device at the end of the netdevice.


Name

free_netdev — free network device

Synopsis

void free_netdev (dev); 
struct net_device *  dev;

Arguments

dev

device

Description

This function does the last stage of destroying an allocated device interface. The reference to the device object is released. If this is the last reference then it will be freed.


Name

synchronize_net — Synchronize with packet receive processing

Synopsis

void synchronize_net (void); 
 void;

Arguments

void

no arguments

Description

Wait for packets currently being received to be done. Does not block later packets from starting.


Name

unregister_netdevice — remove device from the kernel

Synopsis

void unregister_netdevice (dev); 
struct net_device *  dev;

Arguments

dev

device

Description

This function shuts down a device interface and removes it from the kernel tables.

Callers must hold the rtnl semaphore. You may want unregister_netdev instead of this.


Name

unregister_netdev — remove device from the kernel

Synopsis

void unregister_netdev (dev); 
struct net_device *  dev;

Arguments

dev

device

Description

This function shuts down a device interface and removes it from the kernel tables.

This is just a wrapper for unregister_netdevice that takes the rtnl semaphore. In general you want to use this and not unregister_netdevice.


Name

dev_change_net_namespace — move device to different nethost namespace

Synopsis

int dev_change_net_namespace (dev,  
 net,  
 pat); 
struct net_device *  dev;
struct net *  net;
const char *  pat;

Arguments

dev

device

net

network namespace

pat

If not NULL name pattern to try if the current device name is already taken in the destination network namespace.

Description

This function shuts down a device interface and moves it to a new network namespace. On success 0 is returned, on a failure a netagive errno code is returned.

Callers must hold the rtnl semaphore.


Name

netdev_increment_features — increment feature set by one

Synopsis

unsigned long netdev_increment_features (all,  
 one,  
 mask); 
unsigned long  all;
unsigned long  one;
unsigned long  mask;

Arguments

all

current feature set

one

new feature set

mask

mask feature set

Description

Computes a new feature set after adding a device with feature set one to the master device with current feature set all. Will not enable anything that is off in mask. Returns the new feature set.


Name

eth_header — create the Ethernet header

Synopsis

int eth_header (skb,  
 dev,  
 type,  
 daddr,  
 saddr,  
 len); 
struct sk_buff *  skb;
struct net_device *  dev;
unsigned short  type;
const void *  daddr;
const void *  saddr;
unsigned  len;

Arguments

skb

buffer to alter

dev

source device

type

Ethernet type field

daddr

destination address (NULL leave destination address)

saddr

source address (NULL use device source address)

len

packet length (<= skb->len)

Description

Set the protocol type. For a packet of type ETH_P_802_3 we put the length in here instead. It is up to the 802.2 layer to carry protocol information.


Name

eth_rebuild_header — rebuild the Ethernet MAC header.

Synopsis

int eth_rebuild_header (skb); 
struct sk_buff *  skb;

Arguments

skb

socket buffer to update

Description

This is called after an ARP or IPV6 ndisc it's resolution on this sk_buff. We now let protocol (ARP) fill in the other fields.

This routine CANNOT use cached dst->neigh! Really, it is used only when dst->neigh is wrong.


Name

eth_type_trans — determine the packet's protocol ID.

Synopsis

__be16 eth_type_trans (skb,  
 dev); 
struct sk_buff *  skb;
struct net_device *  dev;

Arguments

skb

received socket data

dev

receiving network device

Description

The rule here is that we assume 802.3 if the type field is short enough to be a length. This is normal practice and works for any 'now in use' protocol.


Name

eth_header_parse — extract hardware address from packet

Synopsis

int eth_header_parse (skb,  
 haddr); 
const struct sk_buff *  skb;
unsigned char *  haddr;

Arguments

skb

packet to extract header from

haddr

destination buffer


Name

eth_header_cache — fill cache entry from neighbour

Synopsis

int eth_header_cache (neigh,  
 hh); 
const struct neighbour *  neigh;
struct hh_cache *  hh;

Arguments

neigh

source neighbour

hh

destination cache entry Create an Ethernet header template from the neighbour.


Name

eth_header_cache_update — update cache entry

Synopsis

void eth_header_cache_update (hh,  
 dev,  
 haddr); 
struct hh_cache *  hh;
const struct net_device *  dev;
const unsigned char *  haddr;

Arguments

hh

destination cache entry

dev

network device

haddr

new hardware address

Description

Called by Address Resolution module to notify changes in address.


Name

eth_mac_addr — set new Ethernet hardware address

Synopsis

int eth_mac_addr (dev,  
 p); 
struct net_device *  dev;
void *  p;

Arguments

dev

network device

p

socket address Change hardware address of device.

Description

This doesn't change hardware matching, so needs to be overridden for most real devices.


Name

eth_change_mtu — set new MTU size

Synopsis

int eth_change_mtu (dev,  
 new_mtu); 
struct net_device *  dev;
int  new_mtu;

Arguments

dev

network device

new_mtu

new Maximum Transfer Unit

Description

Allow changing MTU size. Needs to be overridden for devices supporting jumbo frames.


Name

ether_setup — setup Ethernet network device

Synopsis

void ether_setup (dev); 
struct net_device *  dev;

Arguments

dev

network device Fill in the fields of the device structure with Ethernet-generic values.


Name

alloc_etherdev_mq — Allocates and sets up an Ethernet device

Synopsis

struct net_device * alloc_etherdev_mq (sizeof_priv,  
 queue_count); 
int  sizeof_priv;
unsigned int  queue_count;

Arguments

sizeof_priv

Size of additional driver-private structure to be allocated for this Ethernet device

queue_count

The number of queues this device has.

Description

Fill in the fields of the device structure with Ethernet-generic values. Basically does everything except registering the device.

Constructs a new net device, complete with a private data area of size (sizeof_priv). A 32-byte (not bit) alignment is enforced for this private data area.


Name

netif_carrier_on — set carrier

Synopsis

void netif_carrier_on (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Device has detected that carrier.


Name

netif_carrier_off — clear carrier

Synopsis

void netif_carrier_off (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Device has detected loss of carrier.


Name

is_zero_ether_addr — Determine if give Ethernet address is all zeros.

Synopsis

int is_zero_ether_addr (addr); 
const u8 *  addr;

Arguments

addr

Pointer to a six-byte array containing the Ethernet address

Description

Return true if the address is all zeroes.


Name

is_multicast_ether_addr — Determine if the Ethernet address is a multicast.

Synopsis

int is_multicast_ether_addr (addr); 
const u8 *  addr;

Arguments

addr

Pointer to a six-byte array containing the Ethernet address

Description

Return true if the address is a multicast address. By definition the broadcast address is also a multicast address.


Name

is_local_ether_addr — Determine if the Ethernet address is locally-assigned one (IEEE 802).

Synopsis

int is_local_ether_addr (addr); 
const u8 *  addr;

Arguments

addr

Pointer to a six-byte array containing the Ethernet address

Description

Return true if the address is a local address.


Name

is_broadcast_ether_addr — Determine if the Ethernet address is broadcast

Synopsis

int is_broadcast_ether_addr (addr); 
const u8 *  addr;

Arguments

addr

Pointer to a six-byte array containing the Ethernet address

Description

Return true if the address is the broadcast address.


Name

is_valid_ether_addr — Determine if the given Ethernet address is valid

Synopsis

int is_valid_ether_addr (addr); 
const u8 *  addr;

Arguments

addr

Pointer to a six-byte array containing the Ethernet address

Description

Check that the Ethernet address (MAC) is not 00:00:00:00:00:00, is not a multicast address, and is not FF:FF:FF:FF:FF:FF.

Return true if the address is valid.


Name

random_ether_addr — Generate software assigned random Ethernet address

Synopsis

void random_ether_addr (addr); 
u8 *  addr;

Arguments

addr

Pointer to a six-byte array containing the Ethernet address

Description

Generate a random Ethernet address (MAC) that is not multicast and has the local assigned bit set.


Name

compare_ether_addr — Compare two Ethernet addresses

Synopsis

unsigned compare_ether_addr (addr1,  
 addr2); 
const u8 *  addr1;
const u8 *  addr2;

Arguments

addr1

Pointer to a six-byte array containing the Ethernet address

addr2

Pointer other six-byte array containing the Ethernet address

Description

Compare two ethernet addresses, returns 0 if equal


Name

compare_ether_addr_64bits — Compare two Ethernet addresses

Synopsis

unsigned compare_ether_addr_64bits (addr1[6+2],  
 addr2[6+2]); 
const u8  addr1[6+2];
const u8  addr2[6+2];

Arguments

addr1[6+2]

Pointer to an array of 8 bytes

addr2[6+2]

Pointer to an other array of 8 bytes

Description

Compare two ethernet addresses, returns 0 if equal. Same result than “memcmp(addr1, addr2, ETH_ALEN)” but without conditional branches, and possibly long word memory accesses on CPU allowing cheap unaligned memory reads. arrays = { byte1, byte2, byte3, byte4, byte6, byte7, pad1, pad2}

Please note that alignment of addr1 & addr2 is only guaranted to be 16 bits.


Name

is_etherdev_addr — Tell if given Ethernet address belongs to the device.

Synopsis

bool is_etherdev_addr (dev,  
 addr[6 + 2]); 
const struct net_device *  dev;
const u8  addr[6 + 2];

Arguments

dev

Pointer to a device structure

addr[6 + 2]

Pointer to a six-byte array containing the Ethernet address

Description

Compare passed address with all addresses of the device. Return true if the address if one of the device addresses.

Note that this function calls compare_ether_addr_64bits so take care of the right padding.


Name

compare_ether_header — Compare two Ethernet headers

Synopsis

int compare_ether_header (a,  
 b); 
const void *  a;
const void *  b;

Arguments

a

Pointer to Ethernet header

b

Pointer to Ethernet header

Description

Compare two ethernet headers, returns 0 if equal. This assumes that the network header (i.e., IP header) is 4-byte aligned OR the platform can handle unaligned access. This is the case for all packets coming into netif_receive_skb or similar entry points.


Name

napi_schedule_prep — check if napi can be scheduled

Synopsis

int napi_schedule_prep (n); 
struct napi_struct *  n;

Arguments

n

napi context

Description

Test if NAPI routine is already running, and if not mark it as running. This is used as a condition variable insure only one NAPI poll instance runs. We also make sure there is no pending NAPI disable.


Name

napi_schedule — schedule NAPI poll

Synopsis

void napi_schedule (n); 
struct napi_struct *  n;

Arguments

n

napi context

Description

Schedule NAPI poll routine to be called if it is not already running.


Name

napi_disable — prevent NAPI from scheduling

Synopsis

void napi_disable (n); 
struct napi_struct *  n;

Arguments

n

napi context

Description

Stop NAPI from being scheduled on this context. Waits till any outstanding processing completes.


Name

napi_enable — enable NAPI scheduling

Synopsis

void napi_enable (n); 
struct napi_struct *  n;

Arguments

n

napi context

Description

Resume NAPI from being scheduled on this context. Must be paired with napi_disable.


Name

napi_synchronize — wait until NAPI is not running

Synopsis

void napi_synchronize (n); 
const struct napi_struct *  n;

Arguments

n

napi context

Description

Wait until NAPI is done being scheduled on this context. Waits till any outstanding processing completes but does not disable future activations.


Name

netdev_priv — access network device private data

Synopsis

void * netdev_priv (dev); 
const struct net_device *  dev;

Arguments

dev

network device

Description

Get network device private data


Name

netif_start_queue — allow transmit

Synopsis

void netif_start_queue (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Allow upper layers to call the device hard_start_xmit routine.


Name

netif_wake_queue — restart transmit

Synopsis

void netif_wake_queue (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Allow upper layers to call the device hard_start_xmit routine. Used for flow control when transmit resources are available.


Name

netif_stop_queue — stop transmitted packets

Synopsis

void netif_stop_queue (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Stop upper layers calling the device hard_start_xmit routine. Used for flow control when transmit resources are unavailable.


Name

netif_queue_stopped — test if transmit queue is flowblocked

Synopsis

int netif_queue_stopped (dev); 
const struct net_device *  dev;

Arguments

dev

network device

Description

Test if transmit queue on device is currently unable to send.


Name

netif_running — test if up

Synopsis

int netif_running (dev); 
const struct net_device *  dev;

Arguments

dev

network device

Description

Test if the device has been brought up.


Name

netif_start_subqueue — allow sending packets on subqueue

Synopsis

void netif_start_subqueue (dev,  
 queue_index); 
struct net_device *  dev;
u16  queue_index;

Arguments

dev

network device

queue_index

sub queue index

Description

Start individual transmit queue of a device with multiple transmit queues.


Name

netif_stop_subqueue — stop sending packets on subqueue

Synopsis

void netif_stop_subqueue (dev,  
 queue_index); 
struct net_device *  dev;
u16  queue_index;

Arguments

dev

network device

queue_index

sub queue index

Description

Stop individual transmit queue of a device with multiple transmit queues.


Name

__netif_subqueue_stopped — test status of subqueue

Synopsis

int __netif_subqueue_stopped (dev,  
 queue_index); 
const struct net_device *  dev;
u16  queue_index;

Arguments

dev

network device

queue_index

sub queue index

Description

Check individual transmit queue of a device with multiple transmit queues.


Name

netif_wake_subqueue — allow sending packets on subqueue

Synopsis

void netif_wake_subqueue (dev,  
 queue_index); 
struct net_device *  dev;
u16  queue_index;

Arguments

dev

network device

queue_index

sub queue index

Description

Resume individual transmit queue of a device with multiple transmit queues.


Name

netif_is_multiqueue — test if device has multiple transmit queues

Synopsis

int netif_is_multiqueue (dev); 
const struct net_device *  dev;

Arguments

dev

network device

Description

Check if device has multiple transmit queues


Name

dev_put — release reference to device

Synopsis

void dev_put (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Release reference to device to allow it to be freed.


Name

dev_hold — get reference to device

Synopsis

void dev_hold (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Hold reference to device to keep it from being freed.


Name

netif_carrier_ok — test if carrier present

Synopsis

int netif_carrier_ok (dev); 
const struct net_device *  dev;

Arguments

dev

network device

Description

Check if carrier is present on device


Name

netif_dormant_on — mark device as dormant.

Synopsis

void netif_dormant_on (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Mark device as dormant (as per RFC2863).

The dormant state indicates that the relevant interface is not actually in a condition to pass packets (i.e., it is not 'up') but is in a “pending” state, waiting for some external event. For “on- demand” interfaces, this new state identifies the situation where the interface is waiting for events to place it in the up state.


Name

netif_dormant_off — set device as not dormant.

Synopsis

void netif_dormant_off (dev); 
struct net_device *  dev;

Arguments

dev

network device

Description

Device is not in dormant state.


Name

netif_dormant — test if carrier present