Mount failure on one filesystem in an old pool.

Brian Behlendorf behlendorf1 at llnl.gov
Mon May 23 18:31:36 EDT 2011


It looks like you've found a real bug in the current implementation.
The failure your hitting is triggered by processing the unlinked object
zap at mount time.  Usually this list is empty which I suspect it why we
haven't seen this issue until now.  It's also why you don't hit this
issue when mounting a snapshot.  I'll open a bug on the issue tracker so
this gets fixed.  Thanks for reporting it.

-- 
Thanks,
Brian 

On Fri, 2011-05-20 at 23:10 -0700, Travis Tabbal wrote:
> I am attempting to migrate an old pool from OpenSolaris b134 to debian
> squeeze with 0.6.0-rc4 drivers. Also running the Xen 4.1 hypervisor,
> compiled from debian sid source packages. I see xen in the stacktrace,
> so it might be relevant. This is all being done in the dom0, I don't
> have any VMs running presently. Dom0 has 4G of RAM allocated to it, 2G
> are showing as free using the "free"command after the file copy was
> completed.
> 
> I am able to locate the pool and import it fine. Mounting the
> filesystems fails on a single fs. I will include the dmesg output
> below. I am able to mount the last snapshot, and am copying the data
> into a new fs now, so I lost no data. As a result, I wouldn't mind
> getting that auto-snapshot script I was running on OpenSolaris back up
> and running...
> 
> One minor issue that cropped up, my devices get renamed at boot
> apparently. So my pool thought it was corrupted without enough
> redundancy. I was able to rmmod zfs, then add it back, and do "zpool
> import -d . -a" while in the /dev/disk/by-id directory. Now the disks
> are referred to with those names, which should not change, rather than
> "sda" and such. Nice tip from the zfs-fuse mailing list...
> 
> Hardware is an AMD Phenom X3 with LSI SAS controllers on Samsung 1.5T
> drives. Let me know if more details will be useful for you. Thanks for
> getting this on Linux. It's working very well so far.
> 
> 
> [  512.508151] ZFS: Loaded module v0.6.0, ZFS pool version 28, ZFS
> filesystem version 5
> [  920.370905]  zd0: p1 p2
> [  920.567114]  zd16: p1 p2 < p5 >
> [  920.726831]  zd32: p1 p2 < p5 >
> [  920.830148]  zd48: p1
> [  920.961585]  zd64: p1
> [  921.101137]  zd80: p1 p2
> [  921.290212]  zd96: p1 p2
> [  921.472620]  zd112: p1
> [  921.564151]  zd128: p1
> [  921.713899]  zd144: p1
> [  921.817671]  zd160: p1
> [  921.859540]  zd176: p1
> [  921.903539]  zd192: p1
> [  927.924798] general protection fault: 0000 [#1] SMP
> [  927.925706] last sysfs file: /sys/devices/virtual/block/zd96/zd96p2/
> uevent
> [  927.926566] CPU 0
> [  927.927427] Modules linked in: zfs(P) xt_physdev iptable_filter
> ip_tables x_tables xen_evtchn xenfs ib_iser rdma_cm ib_cm iw_cm ib_sa
> ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi
> scsi_transport_iscsi bridge stp zcommon(P) znvpair(P) zavl(P)
> zunicode(P) spl zlib_deflate loop firewire_sbp2 nouveau ttm
> amd64_edac_mod snd_pcm drm_kms_helper snd_timer edac_core drm
> i2c_nforce2 i2c_algo_bit shpchp i2c_core k10temp edac_mce_amd snd
> soundcore snd_page_alloc video pci_hotplug asus_atk0110 output evdev
> processor wmi pcspkr psmouse serio_raw acpi_processor button ext4
> mbcache jbd2 crc16 usbhid hid sd_mod crc_t10dif ata_generic ohci_hcd
> firewire_ohci mptsas ahci mptscsih mptbase firewire_core
> scsi_transport_sas pata_amd ehci_hcd e1000e forcedeth crc_itu_t libata
> scsi_mod usbcore nls_base thermal thermal_sys [last unloaded: zfs]
> [  927.928065] Pid: 4452, comm: mount.zfs Tainted: P        W
> 2.6.32-5-xen-amd64 #1 System Product Name
> [  927.928065] RIP: e030:[<ffffffffa03b7e71>]  [<ffffffffa03b7e71>]
> zfs_inode_destroy+0x4f/0x99 [zfs]
> [  927.928065] RSP: e02b:ffff8800ba383b48  EFLAGS: 00010286
> [  927.928065] RAX: ffff8800b99f7ca0 RBX: dead000000200200 RCX:
> dead000000100100
> [  927.928065] RDX: dead000000200200 RSI: ffff8800ba383b28 RDI:
> ffff8800bbc431d8
> [  927.928065] RBP: ffff8800b99f7cc8 R08: 0000000000000000 R09:
> 8000000000000000
> [  927.928065] R10: dead000000100100 R11: ffffffffa03c1a08 R12:
> ffff8800bbc431d8
> [  927.928065] R13: ffff8800b99f7b50 R14: ffff8800ba383bf8 R15:
> ffff8800cdb9ab00
> [  927.928065] FS:  00007f1ea91c2b40(0000) GS:ffff880003890000(0000)
> knlGS:0000000000000000
> [  927.928065] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
> [  927.928065] CR2: 00007f1b37b62f20 CR3: 00000000ba374000 CR4:
> 0000000000000660
> [  927.928065] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  927.928065] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7:
> 0000000000000400
> [  927.928065] Process mount.zfs (pid: 4452, threadinfo
> ffff8800ba382000, task ffff8800ca4fa350)
> [  927.928065] Stack:
> [  927.928065]  ffff8800ba383bf0 ffff8800ba383bb8 ffff8800bbc43000
> ffff8800ba383bf0
> [  927.928065] <0> ffff8800ba383e08 ffffffffa039e934 0000400000020000
> 0000001100000013
> [  927.928065] <0> 0000000000000108 ffffffff81000006 000000000095d5a1
> 000fff711d5a0000
> [  927.928065] Call Trace:
> [  927.928065]  [<ffffffffa039e934>] ? zfs_unlinked_drain+0x95/0xdc
> [zfs]
> [  927.928065]  [<ffffffff8100ecf2>] ? check_events+0x12/0x20
> [  927.928065]  [<ffffffff8100ecdf>] ? xen_restore_fl_direct_end
> +0x0/0x1
> [  927.928065]  [<ffffffff8130d13a>] ? _spin_unlock_irqrestore+0xd/0xe
> [  927.928065]  [<ffffffffa02cba25>] ? __taskq_create+0x361/0x387
> [spl]
> [  927.928065]  [<ffffffff8100e63d>] ? xen_force_evtchn_callback
> +0x9/0xa
> [  927.928065]  [<ffffffff8100ecf2>] ? check_events+0x12/0x20
> [  927.928065]  [<ffffffff81065eb2>] ? autoremove_wake_function
> +0x0/0x2e
> [  927.928065]  [<ffffffffa03b39b6>] ? zfs_get_data+0x0/0x22a [zfs]
> [  927.928065]  [<ffffffffa03c194b>] ? zpl_fill_super+0x0/0xd [zfs]
> [  927.928065]  [<ffffffffa03adf17>] ? zfs_sb_setup+0x7d/0xe0 [zfs]
> [  927.928065]  [<ffffffffa03ae2da>] ? zfs_domount+0x1c7/0x22b [zfs]
> [  927.928065]  [<ffffffff810f1796>] ? sget+0x39d/0x3af
> [  927.928065]  [<ffffffff810f0f70>] ? set_anon_super+0x0/0xd5
> [  927.928065]  [<ffffffffa03c1954>] ? zpl_fill_super+0x9/0xd [zfs]
> [  927.928065]  [<ffffffff810f1f75>] ? get_sb_nodev+0x4f/0x83
> [  927.928065]  [<ffffffffa03c1946>] ? zpl_get_sb+0x21/0x26 [zfs]
> [  927.928065]  [<ffffffff810ba593>] ? __get_free_pages+0x9/0x46
> [  927.928065]  [<ffffffff810f1cdf>] ? vfs_kern_mount+0x99/0x14b
> [  927.928065]  [<ffffffff810f1de4>] ? do_kern_mount+0x43/0xe2
> [  927.928065]  [<ffffffff81105eab>] ? do_mount+0x72a/0x792
> [  927.928065]  [<ffffffff81105f93>] ? sys_mount+0x80/0xbd
> [  927.928065]  [<ffffffff81011b42>] ? system_call_fastpath+0x16/0x1b
> [  927.928065] Code: a3 d8 01 00 00 4c 89 e7 e8 76 48 f5 e0 4c 89 e8
> 48 03 83 c0 01 00 00 48 bb 00 02 20 00 00 00 ad de 4c 89 e7 48 8b 50
> 08 48 8b 08 <48> 89 51 08 48 89 0a 48 ba 00 01 10 00 00 00 ad de 48 89
> 10 48
> [  927.928065] RIP  [<ffffffffa03b7e71>] zfs_inode_destroy+0x4f/0x99
> [zfs]
> [  927.928065]  RSP <ffff8800ba383b48>
> [  927.928065] ---[ end trace a7919e7f17c0a727 ]---



More information about the zfs-discuss mailing list