[Oisf-users] File Extraction Woes

Fri May 30 17:17:52 UTC 2014

On 05/30/2014 06:31 PM, Jason Batchelor wrote:
> Hello,
> 
> I am having some issues with file extraction in Suricata, and after
> attempting to do many optimizations and review of others experiences I
> am still finding myself out of luck. Below is some verbose output of my
> current configuration and some sample data after ~12 hours of running. I
> have also included a smaller time frame with a few subtle changes for
> consideration.
> 
> Under this configuration the rule I have, which is an IP based rule below...
> 
> alert http any any -> $MY_LOCAL_IP any (msg:"FILE PDF"; filemagic:"PDF
> document"; filestore; sid:1; rev:1;)
> 
> Does not trigger at all when the reassembly mem cap is reached. Even
> when it does (when reassembly memcap is below the threshold), I get
> truncated PDFs. I have tried adjusting things like the reassembly
> memcap, however, when I do, I very quickly run into a large amount of
> packet loss because the number of free slots PF_RING can issue is not
> able to keep up (details below). Additionally, reassembly mem cap seems
> to slowly increase over time, eventually reaching its peak before the
> number of free ring slots can finally keep up (presumably due to
> segments being dropped).
> 
> I have struggled playing with time out values as well really to no avail
> (details below).
> 
> When I turn http logging on, I do see the website that I go to being
> properly logged fwiw.
> 
> I feel like I must be doing something wrong, or I am not seeing
> something obvious. After reviewing many blogs and howtos, it seems folks
> are able to do what I am trying to accomplish with the same (sometime
> more) data rates and much less hardware.
> 
> I have tried the following:
> - Increased min_num_slots to 65534 for PF_RING
> - Tinkered with TCP timeout settings
> - Adjusted reassembly memcap
> 
> Kindly take a look at the details I have listed below and let me know if
> there is anything you can suggest. I am curious if I am just plain at
> the limit of my hardware and need to consider upgrading and/or getting
> PF_RING with DNA. Or, perhaps there are a few more items I should
> consider within the application itself.
> 
> One final thing to consider, would tcp sequence randomization
> significantly impact things? I would need to get in touch with the folks
> responsible to see if we have this on but thought I would ask here as well!
> 
> Many thanks in advance for your time looking at this!
> 
> == Profile ==
> 
> CentOS 6.5 Linux
> Kernel 2.6.32-431.11.2.el6.x86_64
> 
> Installed Suricata 2.0 and PF_RING 6.0.1 from source.
> 
> Machine sees ~400MB/s at peek load.
> 
> == Tuning ==
> 
> I've tuned the ixgbe NIC with the following settings...
> 
> ethtool -K p4p2 tso off
> ethtool -K p4p2 gro off
> ethtool -K p4p2 lro off
> ethtool -K p4p2 gso off
> ethtool -K p4p2 rx off
> ethtool -K p4p2 tx off
> ethtool -K p4p2 sg off
> ethtool -K p4p2 rxvlan off
> ethtool -K p4p2 txvlan off
> ethtool -N p4p2 rx-flow-hash udp4 sdfn
> ethtool -N p4p2 rx-flow-hash udp6 sdfn
> ethtool -n p4p2 rx-flow-hash udp6
> ethtool -n p4p2 rx-flow-hash udp4
> ethtool -C p4p2 rx-usecs 1000
> ethtool -C p4p2 adaptive-rx off
> 
> It is also using the latest driver available. I have also tried to
> optimize things in the sysctl.conf
> 
> # -- 10gbe tuning from Intel ixgb driver README -- #
> 
> # turn off selective ACK and timestamps
> net.ipv4.tcp_sack = 0
> net.ipv4.tcp_timestamps = 0
> 
> # memory allocation min/pressure/max.
> # read buffer, write buffer, and buffer space
> net.ipv4.tcp_rmem = 10000000 10000000 10000000
> net.ipv4.tcp_wmem = 10000000 10000000 10000000
> net.ipv4.tcp_mem = 10000000 10000000 10000000
> 
> net.core.rmem_max = 524287
> net.core.wmem_max = 524287
> net.core.rmem_default = 524287
> net.core.wmem_default = 524287
> net.core.optmem_max = 524287
> net.core.netdev_max_backlog = 300000
> 
> == Hardware Specs ==
> CPU: Intel Xeon CPU @ 2.40Ghz x 32
> RAM: 48G
> NIC:
>   *-network:1
>        description: Ethernet interface
>        product: Ethernet 10G 2P X520 Adapter
>        vendor: Intel Corporation
>        physical id: 0.1
>        bus info: pci at 0000:42:00.1
>        logical name: p4p2
>        version: 01
>        serial: a0:36:9f:07:ec:02
>        capacity: 1GB/s
>        width: 64 bits
>        clock: 33MHz
>        capabilities: pm msi msix pciexpress vpd bus_master cap_list rom
> ethernet physical fibre 1000bt-fd autonegotiation
>        configuration: autonegotiation=on broadcast=yes driver=ixgbe
> driverversion=3.21.2 duplex=full firmware=0x8000030d latency=0 link=yes
> multicast=yes port=fibre promiscuous=yes
>        resources: irq:76 memory:d0f00000-d0f7ffff ioport:7ce0(size=32)
> memory:d0ffc000-d0ffffff memory:d1100000-d117ffff(prefetchable)
> memory:d1380000-d147ffff(prefetchable)
> memory:d1480000-d157ffff(prefetchable)
> 
> == Suricata Config ==
> Below are some details that may be relevant...
> 
> runmode: workers
> 
> host-mode: sniffer-only
> 
> default-packet-size: 9000
> 
> - file-store:
>     enabled: yes       # set to yes to enable
>     log-dir: files    # directory to store the files
>     force-magic: yes   # force logging magic on all stored files
>     force-md5: yes     # force logging of md5 checksums
>     waldo: file.waldo # waldo file to store the file_id across runs
> 
> defrag:
>   memcap: 512mb      
>   hash-size: 65536
>   trackers: 65535  # number of defragmented flows to follow
>   max-frags: 65535 # number of fragments to keep (higher than trackers)
>   prealloc: yes
>   timeout: 30         
> 
> flow:
>   memcap: 1gb          
>   hash-size: 1048576   
>   prealloc: 1048576    
>   emergency-recovery: 30
> 
> flow-timeouts:
>   default:
>     new: 1
>     established: 5
>     closed: 0
>     emergency-new: 1
>     emergency-established: 1
>     emergency-closed: 0
>   tcp:
>     new: 15
>     established: 100
>     closed: 5
>     emergency-new: 1
>     emergency-established: 1
>     emergency-closed: 0
>   udp:
>     new: 5
>     established: 10
>     emergency-new: 1
>     emergency-established: 1
>   icmp:
>     new: 1
>     established: 5
>     emergency-new: 1
>     emergency-established: 1
> 
> stream:
>   memcap: 10gb                  

This is excessive, although it won't hurt.

>   checksum-validation: no        # reject wrong csums
>   prealloc-sesions: 500000      
>   midstream: false             
>   asyn-oneside: false           
>   inline: no                     # auto will use inline mode in IPS
> mode, yes or no set it statically
>   reassembly:
>     memcap: 1.5gb                 

If you have 48G of mem, I think you can use a lot more here. Like 16G or
something.

>     depth: 5mb                 
>     toserver-chunk-size: 2560
>     toclient-chunk-size: 2560
>     randomize-chunk-size: yes
> 
> host:
>   hash-size: 4096
>   prealloc: 1000
>   memcap: 16777216
> 
> 
> pfring:
>   - interface: p4p2
>     threads: 16
>     cluster-id: 99
>  cluster-type: cluster_flow
>  checksum-checks: no
>  - interface: default
> 
> http:
>    enabled: yes
>    libhtp:
>       default-config:
>         personality: IDS
> 
>         # Can be specified in kb, mb, gb.  Just a number indicates
>         # it's in bytes.
>         request-body-limit: 12mb
>         response-body-limit: 12mb

As stream depth is 5mb, setting 12mb here doesn't really affect anything
I think. The stream depth cuts stream tracking at 5mb regardless of the
setting here.

>  
> == ~12 hours (above config) =
> 
> top - 14:58:59 up 18:23,  3 users,  load average: 6.44, 4.83, 4.32
> Tasks: 664 total,   1 running, 663 sleeping,   0 stopped,   0 zombie
> Cpu(s): 17.9%us,  0.1%sy,  0.0%ni, 80.3%id,  0.0%wa,  0.0%hi,  1.7%si, 
> 0.0%st
> Mem:  49376004k total, 29289768k used, 20086236k free,    68340k buffers
> Swap:  2621432k total,        0k used,  2621432k free,   820172k cached
> 
>   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
> 17616 root      20   0 27.0g  26g  16g S 621.4 55.8   3532:51 Suricata-Main
> 
> watch 'cat /proc/net/pf_ring/*p4p2* | egrep "Num Free Slots|Tot
> Packets|Tot Pkt Lost"'
> ; First three threads...
> Tot Packets        : 627370957
> Tot Pkt Lost       : 3582014
> Num Free Slots     : 118705
> Tot Packets        : 676753767
> Tot Pkt Lost       : 5292092
> Num Free Slots     : 118745
> Tot Packets        : 665348839
> Tot Pkt Lost       : 3841911
> Num Free Slots     : 118677
> ...
> 
> watch -n 10 'cat stats.log | egrep
> "reassembly_memuse|segment_memcap_drop" | tail -n 32'
> ; First three threads...
> tcp.segment_memcap_drop   | RxPFRp4p21                | 25782329
> tcp.reassembly_memuse     | RxPFRp4p21                | 1610612705
> tcp.segment_memcap_drop   | RxPFRp4p22                | 26161478
> tcp.reassembly_memuse     | RxPFRp4p22                | 1610612705
> tcp.segment_memcap_drop   | RxPFRp4p23                | 25813867
> tcp.reassembly_memuse     | RxPFRp4p23                | 1610612705
> 
> grep 'reassembly_gap' stats.log | tail -n 10
> tcp.reassembly_gap        | RxPFRp4p27                | 777366
> tcp.reassembly_gap        | RxPFRp4p28                | 774896
> tcp.reassembly_gap        | RxPFRp4p29                | 781761
> tcp.reassembly_gap        | RxPFRp4p210               | 776427
> tcp.reassembly_gap        | RxPFRp4p211               | 778734
> tcp.reassembly_gap        | RxPFRp4p212               | 773203
> tcp.reassembly_gap        | RxPFRp4p213               | 781125
> tcp.reassembly_gap        | RxPFRp4p214               | 776043
> tcp.reassembly_gap        | RxPFRp4p215               | 781790
> tcp.reassembly_gap        | RxPFRp4p216               | 783368
> 
> == PF RING ==
> 
> PF_RING Version          : 6.0.1 ($Revision: exported$)
> Total rings              : 16
> 
> Standard (non DNA) Options
> Ring slots               : 65534
> Slot version             : 15
> Capture TX               : No [RX only]
> IP Defragment            : No
> Socket Mode              : Standard
> Transparent mode         : Yes [mode 0]
> Total plugins            : 0
> Cluster Fragment Queue   : 9175
> Cluster Fragment Discard : 597999
> 
> == ~30 min (with changes) ==
> 
> FWIW, when I increase reassembly memcap and time outs to the following...
> 
> flow-timeouts:
>   default:
>     new: 5
>     established: 50
>     closed: 0
>     emergency-new: 1
>     emergency-established: 1
>     emergency-closed: 0
>   tcp:
>     new: 15
>     established: 100
>     closed: 10
>     emergency-new: 1
>     emergency-established: 1
>     emergency-closed: 0
>   udp:
>     new: 5
>     established: 50
>     emergency-new: 1
>     emergency-established: 1
>   icmp:
>     new: 1
>     established: 5
>     emergency-new: 1
>     emergency-established: 1
> 
> reassembly:
>   memcap: 3gb             
>   depth: 5mb
>  
> These are the results, note how there are no more free slots for
> PF_RING. I believe this results in increased packet loss... which is
> likely resulting in my truncated files that I receive when I pull a PDF.
> 
> watch 'cat /proc/net/pf_ring/*p4p2* | egrep "Num Free Slots|Tot
> Packets|Tot Pkt Lost"'
> ; First three threads...
> Tot Packets        : 80281541
> Tot Pkt Lost       : 44290194
> Num Free Slots     : 0
> Tot Packets        : 81926241
> Tot Pkt Lost       : 17412402
> Num Free Slots     : 0
> Tot Packets        : 80108557
> Tot Pkt Lost       : 14667061
> Num Free Slots     : 0
> 
> watch -n 10 'cat stats.log | egrep
> "reassembly_memuse|segment_memcap_drop" | tail -n 32'
> ; First three threads...
> tcp.segment_memcap_drop   | RxPFRp4p21                | 0
> tcp.reassembly_memuse     | RxPFRp4p21                | 1681598708
> tcp.segment_memcap_drop   | RxPFRp4p22                | 0
> tcp.reassembly_memuse     | RxPFRp4p22                | 1681626644
> tcp.segment_memcap_drop   | RxPFRp4p23                | 0
> tcp.reassembly_memuse     | RxPFRp4p23                | 1681597556
> tcp.segment_memcap_drop   | RxPFRp4p24                | 0
> *** Important to note here, the reassembly memuse seems to steadily
> increase overtime. After a few minutes of putting this in it has risen
> to 2022140776 across. Makes me think things are not offloading
> quickly... (timeout/depth issue?)
> 
> grep 'reassembly_gap' stats.log | tail -n 10
> tcp.reassembly_gap        | RxPFRp4p27                | 27603
> tcp.reassembly_gap        | RxPFRp4p28                | 26677
> tcp.reassembly_gap        | RxPFRp4p29                | 26869
> tcp.reassembly_gap        | RxPFRp4p210               | 25031
> tcp.reassembly_gap        | RxPFRp4p211               | 23988
> tcp.reassembly_gap        | RxPFRp4p212               | 23809
> tcp.reassembly_gap        | RxPFRp4p213               | 26420
> tcp.reassembly_gap        | RxPFRp4p214               | 25271
> tcp.reassembly_gap        | RxPFRp4p215               | 26285
> tcp.reassembly_gap        | RxPFRp4p216               | 26848

In 2.0.1 the stream engine should use less memory and clear memory
quicker. Could you try 2.0.1?

Also, if you have multiple vlans on the network, it may be worth trying
to disable:

vlan:
  use-for-tracking: true

I think you've probably checked all or most things on them, but perhaps
these diagrams here can be of some help here:
https://redmine.openinfosecfoundation.org/projects/suricata/wiki/Self_Help_Diagrams

-- 
---------------------------------------------
Victor Julien
http://www.inliniac.net/
PGP: http://www.inliniac.net/victorjulien.asc
---------------------------------------------