sysfs.txt 11.8 KB
Newer Older
Linus Torvalds's avatar
Linus Torvalds committed
1 2 3 4

sysfs - _The_ filesystem for exporting kernel objects. 

Patrick Mochel	<mochel@osdl.org>
5
Mike Murphy <mamurph@cs.clemson.edu>
Linus Torvalds's avatar
Linus Torvalds committed
6

7
Revised:    16 August 2011
8
Original:   10 January 2003
Linus Torvalds's avatar
Linus Torvalds committed
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25


What it is:
~~~~~~~~~~~

sysfs is a ram-based filesystem initially based on ramfs. It provides
a means to export kernel data structures, their attributes, and the 
linkages between them to userspace. 

sysfs is tied inherently to the kobject infrastructure. Please read
Documentation/kobject.txt for more information concerning the kobject
interface. 


Using sysfs
~~~~~~~~~~~

26 27
sysfs is always compiled in if CONFIG_SYSFS is defined. You can access
it by doing:
Linus Torvalds's avatar
Linus Torvalds committed
28 29 30 31 32 33 34 35 36 37 38 39 40 41

    mount -t sysfs sysfs /sys 


Directory Creation
~~~~~~~~~~~~~~~~~~

For every kobject that is registered with the system, a directory is
created for it in sysfs. That directory is created as a subdirectory
of the kobject's parent, expressing internal object hierarchies to
userspace. Top-level directories in sysfs represent the common
ancestors of object hierarchies; i.e. the subsystems the objects
belong to. 

42
Sysfs internally stores a pointer to the kobject that implements a
43
directory in the kernfs_node object associated with the directory. In
44 45 46 47
the past this kobject pointer has been used by sysfs to do reference
counting directly on the kobject whenever the file is opened or closed.
With the current sysfs implementation the kobject reference count is
only modified directly by the function sysfs_schedule_callback().
Linus Torvalds's avatar
Linus Torvalds committed
48 49 50 51 52 53 54 55 56 57 58


Attributes
~~~~~~~~~~

Attributes can be exported for kobjects in the form of regular files in
the filesystem. Sysfs forwards file I/O operations to methods defined
for the attributes, providing a means to read and write kernel
attributes.

Attributes should be ASCII text files, preferably with only one value
59
per file. It is noted that it may not be efficient to contain only one
Linus Torvalds's avatar
Linus Torvalds committed
60 61 62 63 64
value per file, so it is socially acceptable to express an array of
values of the same type. 

Mixing types, expressing multiple lines of data, and doing fancy
formatting of data is heavily frowned upon. Doing these things may get
Lucas De Marchi's avatar
Lucas De Marchi committed
65
you publicly humiliated and your code rewritten without notice. 
Linus Torvalds's avatar
Linus Torvalds committed
66 67 68 69 70 71


An attribute definition is simply:

struct attribute {
        char                    * name;
72
        struct module		*owner;
Al Viro's avatar
Al Viro committed
73
        umode_t                 mode;
Linus Torvalds's avatar
Linus Torvalds committed
74 75 76
};


77 78
int sysfs_create_file(struct kobject * kobj, const struct attribute * attr);
void sysfs_remove_file(struct kobject * kobj, const struct attribute * attr);
Linus Torvalds's avatar
Linus Torvalds committed
79 80 81 82 83 84 85 86 87 88


A bare attribute contains no means to read or write the value of the
attribute. Subsystems are encouraged to define their own attribute
structure and wrapper functions for adding and removing attributes for
a specific object type. 

For example, the driver model defines struct device_attribute like:

struct device_attribute {
89 90 91 92 93
	struct attribute	attr;
	ssize_t (*show)(struct device *dev, struct device_attribute *attr,
			char *buf);
	ssize_t (*store)(struct device *dev, struct device_attribute *attr,
			 const char *buf, size_t count);
Linus Torvalds's avatar
Linus Torvalds committed
94 95
};

96 97
int device_create_file(struct device *, const struct device_attribute *);
void device_remove_file(struct device *, const struct device_attribute *);
Linus Torvalds's avatar
Linus Torvalds committed
98 99 100

It also defines this helper for defining device attributes: 

101 102
#define DEVICE_ATTR(_name, _mode, _show, _store) \
struct device_attribute dev_attr_##_name = __ATTR(_name, _mode, _show, _store)
Linus Torvalds's avatar
Linus Torvalds committed
103 104 105

For example, declaring

106
static DEVICE_ATTR(foo, S_IWUSR | S_IRUGO, show_foo, store_foo);
Linus Torvalds's avatar
Linus Torvalds committed
107 108 109 110

is equivalent to doing:

static struct device_attribute dev_attr_foo = {
111
	.attr = {
Linus Torvalds's avatar
Linus Torvalds committed
112
		.name = "foo",
113
		.mode = S_IWUSR | S_IRUGO,
Linus Torvalds's avatar
Linus Torvalds committed
114
	},
115 116
	.show = show_foo,
	.store = store_foo,
Linus Torvalds's avatar
Linus Torvalds committed
117 118 119 120 121 122 123 124 125 126 127
};


Subsystem-Specific Callbacks
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

When a subsystem defines a new attribute type, it must implement a
set of sysfs operations for forwarding read and write calls to the
show and store methods of the attribute owners. 

struct sysfs_ops {
128
        ssize_t (*show)(struct kobject *, struct attribute *, char *);
129
        ssize_t (*store)(struct kobject *, struct attribute *, const char *, size_t);
Linus Torvalds's avatar
Linus Torvalds committed
130 131 132 133 134 135 136 137 138 139 140 141 142 143
};

[ Subsystems should have already defined a struct kobj_type as a
descriptor for this type, which is where the sysfs_ops pointer is
stored. See the kobject documentation for more information. ]

When a file is read or written, sysfs calls the appropriate method
for the type. The method then translates the generic struct kobject
and struct attribute pointers to the appropriate pointer types, and
calls the associated methods. 


To illustrate:

144
#define to_dev(obj) container_of(obj, struct device, kobj)
145
#define to_dev_attr(_attr) container_of(_attr, struct device_attribute, attr)
Linus Torvalds's avatar
Linus Torvalds committed
146

147 148
static ssize_t dev_attr_show(struct kobject *kobj, struct attribute *attr,
                             char *buf)
Linus Torvalds's avatar
Linus Torvalds committed
149
{
150 151 152
        struct device_attribute *dev_attr = to_dev_attr(attr);
        struct device *dev = to_dev(kobj);
        ssize_t ret = -EIO;
Linus Torvalds's avatar
Linus Torvalds committed
153 154

        if (dev_attr->show)
155 156
                ret = dev_attr->show(dev, dev_attr, buf);
        if (ret >= (ssize_t)PAGE_SIZE) {
157 158
                printk("dev_attr_show: %pS returned bad count\n",
                                dev_attr->show);
159
        }
Linus Torvalds's avatar
Linus Torvalds committed
160 161 162 163 164 165 166 167 168 169 170 171
        return ret;
}



Reading/Writing Attribute Data
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

To read or write attributes, show() or store() methods must be
specified when declaring the attribute. The method types should be as
simple as those defined for device attributes:

172 173 174
ssize_t (*show)(struct device *dev, struct device_attribute *attr, char *buf);
ssize_t (*store)(struct device *dev, struct device_attribute *attr,
                 const char *buf, size_t count);
Linus Torvalds's avatar
Linus Torvalds committed
175

176
IOW, they should take only an object, an attribute, and a buffer as parameters.
Linus Torvalds's avatar
Linus Torvalds committed
177 178 179 180 181 182 183 184 185 186 187


sysfs allocates a buffer of size (PAGE_SIZE) and passes it to the
method. Sysfs will call the method exactly once for each read or
write. This forces the following behavior on the method
implementations: 

- On read(2), the show() method should fill the entire buffer. 
  Recall that an attribute should only be exporting one value, or an
  array of similar values, so this shouldn't be that expensive. 

188 189 190 191
  This allows userspace to do partial reads and forward seeks
  arbitrarily over the entire file at will. If userspace seeks back to
  zero or does a pread(2) with an offset of '0' the show() method will
  be called again, rearmed, to fill the buffer.
Linus Torvalds's avatar
Linus Torvalds committed
192 193

- On write(2), sysfs expects the entire buffer to be passed during the
194 195 196 197
  first write. Sysfs then passes the entire buffer to the store() method.
  A terminating null is added after the data on stores. This makes
  functions like sysfs_streq() safe to use.

Linus Torvalds's avatar
Linus Torvalds committed
198 199 200 201 202 203 204 205 206
  When writing sysfs files, userspace processes should first read the
  entire file, modify the values it wishes to change, then write the
  entire buffer back. 

  Attribute method implementations should operate on an identical
  buffer when reading and writing values. 

Other notes:

207 208 209
- Writing causes the show() method to be rearmed regardless of current
  file position.

Linus Torvalds's avatar
Linus Torvalds committed
210 211 212 213
- The buffer will always be PAGE_SIZE bytes in length. On i386, this
  is 4096. 

- show() methods should return the number of bytes printed into the
214
  buffer. This is the return value of scnprintf().
Linus Torvalds's avatar
Linus Torvalds committed
215

216 217 218 219
- show() must not use snprintf() when formatting the value to be
  returned to user space. If you can guarantee that an overflow
  will never happen you can use sprintf() otherwise you must use
  scnprintf().
Linus Torvalds's avatar
Linus Torvalds committed
220

221 222
- store() should return the number of bytes used from the buffer. If the
  entire buffer has been used, just return the count argument.
Linus Torvalds's avatar
Linus Torvalds committed
223 224 225 226 227 228 229 230 231 232 233 234

- show() or store() can always return errors. If a bad value comes
  through, be sure to return an error.

- The object passed to the methods will be pinned in memory via sysfs
  referencing counting its embedded object. However, the physical 
  entity (e.g. device) the object represents may not be present. Be 
  sure to have a way to check this, if necessary. 


A very simple (and naive) implementation of a device attribute is:

235 236
static ssize_t show_name(struct device *dev, struct device_attribute *attr,
                         char *buf)
Linus Torvalds's avatar
Linus Torvalds committed
237
{
238
	return scnprintf(buf, PAGE_SIZE, "%s\n", dev->name);
Linus Torvalds's avatar
Linus Torvalds committed
239 240
}

241 242
static ssize_t store_name(struct device *dev, struct device_attribute *attr,
                          const char *buf, size_t count)
Linus Torvalds's avatar
Linus Torvalds committed
243
{
244 245 246
        snprintf(dev->name, sizeof(dev->name), "%.*s",
                 (int)min(count, sizeof(dev->name) - 1), buf);
	return count;
Linus Torvalds's avatar
Linus Torvalds committed
247 248
}

249
static DEVICE_ATTR(name, S_IRUGO, show_name, store_name);
Linus Torvalds's avatar
Linus Torvalds committed
250 251 252 253 254 255 256 257 258 259 260 261


(Note that the real implementation doesn't allow userspace to set the 
name for a device.)


Top Level Directory Layout
~~~~~~~~~~~~~~~~~~~~~~~~~~

The sysfs directory arrangement exposes the relationship of kernel
data structures. 

262
The top level sysfs directory looks like:
Linus Torvalds's avatar
Linus Torvalds committed
263 264 265 266

block/
bus/
class/
267
dev/
Linus Torvalds's avatar
Linus Torvalds committed
268 269 270
devices/
firmware/
net/
271
fs/
Linus Torvalds's avatar
Linus Torvalds committed
272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289

devices/ contains a filesystem representation of the device tree. It maps
directly to the internal kernel device tree, which is a hierarchy of
struct device. 

bus/ contains flat directory layout of the various bus types in the
kernel. Each bus's directory contains two subdirectories:

	devices/
	drivers/

devices/ contains symlinks for each device discovered in the system
that point to the device's directory under root/.

drivers/ contains a directory for each device driver that is loaded
for devices on that particular bus (this assumes that drivers do not
span multiple bus types).

290 291 292 293
fs/ contains a directory for some filesystems.  Currently each
filesystem wanting to export attributes must create its own hierarchy
below fs/ (see ./fuse.txt for an example).

294 295 296 297 298
dev/ contains two directories char/ and block/. Inside these two
directories there are symlinks named <major>:<minor>.  These symlinks
point to the sysfs directory for the given device.  /sys/dev provides a
quick way to lookup the sysfs interface for a device from the result of
a stat(2) operation.
Linus Torvalds's avatar
Linus Torvalds committed
299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317

More information can driver-model specific features can be found in
Documentation/driver-model/. 


TODO: Finish this section.


Current Interfaces
~~~~~~~~~~~~~~~~~~

The following interface layers currently exist in sysfs:


- devices (include/linux/device.h)
----------------------------------
Structure:

struct device_attribute {
318 319 320 321 322
	struct attribute	attr;
	ssize_t (*show)(struct device *dev, struct device_attribute *attr,
			char *buf);
	ssize_t (*store)(struct device *dev, struct device_attribute *attr,
			 const char *buf, size_t count);
Linus Torvalds's avatar
Linus Torvalds committed
323 324 325 326
};

Declaring:

327
DEVICE_ATTR(_name, _mode, _show, _store);
Linus Torvalds's avatar
Linus Torvalds committed
328 329 330

Creation/Removal:

331 332
int device_create_file(struct device *dev, const struct device_attribute * attr);
void device_remove_file(struct device *dev, const struct device_attribute * attr);
Linus Torvalds's avatar
Linus Torvalds committed
333 334 335 336 337 338 339 340 341


- bus drivers (include/linux/device.h)
--------------------------------------
Structure:

struct bus_attribute {
        struct attribute        attr;
        ssize_t (*show)(struct bus_type *, char * buf);
342
        ssize_t (*store)(struct bus_type *, const char * buf, size_t count);
Linus Torvalds's avatar
Linus Torvalds committed
343 344 345 346
};

Declaring:

347
BUS_ATTR(_name, _mode, _show, _store)
Linus Torvalds's avatar
Linus Torvalds committed
348 349 350 351 352 353 354 355 356 357 358 359 360 361 362

Creation/Removal:

int bus_create_file(struct bus_type *, struct bus_attribute *);
void bus_remove_file(struct bus_type *, struct bus_attribute *);


- device drivers (include/linux/device.h)
-----------------------------------------

Structure:

struct driver_attribute {
        struct attribute        attr;
        ssize_t (*show)(struct device_driver *, char * buf);
363 364
        ssize_t (*store)(struct device_driver *, const char * buf,
                         size_t count);
Linus Torvalds's avatar
Linus Torvalds committed
365 366 367 368
};

Declaring:

369 370
DRIVER_ATTR_RO(_name)
DRIVER_ATTR_RW(_name)
Linus Torvalds's avatar
Linus Torvalds committed
371 372 373

Creation/Removal:

374 375
int driver_create_file(struct device_driver *, const struct driver_attribute *);
void driver_remove_file(struct device_driver *, const struct driver_attribute *);
Linus Torvalds's avatar
Linus Torvalds committed
376 377


378 379 380 381 382 383 384 385
Documentation
~~~~~~~~~~~~~

The sysfs directory structure and the attributes in each directory define an
ABI between the kernel and user space. As for any ABI, it is important that
this ABI is stable and properly documented. All new sysfs attributes must be
documented in Documentation/ABI. See also Documentation/ABI/README for more
information.