Memory mapped files, managed file pointer and offset

Question

Memory mapped files, managed file pointer and offset

I am a little confused by the Boost library terminology (for windows). What I'm trying to do is simple; create a file on disk (large file> 50 GB) do some mapping for write and read operations separately.

For example, first draw 1 gb part for recording, and then run it on your hard drive, take a new batch, and so on, while reader applications map different parts of the file and read without changing anything (without editing) .

I am reading the boost documentation (version 1.47.0, since we allowed it to be used), and I don’t understand when to use memory files , such as: file_mapping, managed_region and the managed map file : basic_managed_mapped_file and Offset_Ptr, for example.

Can someone please tell me what is the difference between memory-related files and the managed associated file, and what are their uses?

Some code examples would be greatly appreciated by this and Offset_ptr, if possible.

Thanks, really ...

+3

c ++ boost shared-memory memory-mapped-files interprocess

user2955554 Jan 08 '14 at 9:11

source share

1 answer

sehe · Answer 1 · 2014-01-09T01:05:05+0000

You can use managed_mapped_file to transparently allocate from a memory mapped file.

This means that for all practical purposes, you often do not need to duplicate memory areas. All of this is virtual memory, so paging takes care of loading the right bits at the right time.

Obviously, if there is a lot of fragmentation or access to jumps, then paging can become a performance bottleneck. In this case, consider dividing into pools and extracting from them.) _

Edit I just noticed that Boost IPC supports this in Separated Storage node allocators and Adaptive pool node allocators . There are also notes on the implementation of these storage pools here .

Here's a simple starting point that creates a 50 GB file and contains some data in it:

 #include <iostream> #include <string> #include <vector> #include <iterator> #include <algorithm> #include <boost/container/flat_map.hpp> #include <boost/container/flat_set.hpp> #include <boost/interprocess/managed_mapped_file.hpp> #include <boost/container/scoped_allocator.hpp> #include <boost/interprocess/containers/string.hpp> #include <boost/interprocess/containers/vector.hpp> #include <boost/interprocess/sync/named_mutex.hpp> #include <boost/interprocess/sync/scoped_lock.hpp> namespace bip = boost::interprocess; using mutex_type = bip::named_mutex; struct X { char buf[100]; double rate; uint32_t samples[1024]; }; template <typename T> using shared_alloc = bip::allocator<T,bip::managed_mapped_file::segment_manager>; template <typename T> using shared_vector = boost::container::vector<T, shared_alloc<T> >; template <typename K, typename V, typename P = std::pair<K,V>, typename Cmp = std::less<K> > using shared_map = boost::container::flat_map<K, V, Cmp, shared_alloc<P> >; using shared_string = bip::basic_string<char,std::char_traits<char>,shared_alloc<char> >; using dataset_t = shared_map<shared_string, shared_vector<X> >; struct mutex_remove { mutex_remove() { mutex_type::remove("7FD6D7E8-320B-11DC-82CF-39598D556B0E"); } ~mutex_remove(){ mutex_type::remove("7FD6D7E8-320B-11DC-82CF-39598D556B0E"); } } remover; static mutex_type mutex(bip::open_or_create,"7FD6D7E8-320B-11DC-82CF-39598D556B0E"); static dataset_t& shared_instance() { bip::scoped_lock<mutex_type> lock(mutex); static bip::managed_mapped_file seg(bip::open_or_create,"./demo.db", 50ul<<30); // "50Gb ought to be enough for anyone" static dataset_t* _instance = seg.find_or_construct<dataset_t> ("DATA") ( std::less<shared_string>(), dataset_t::allocator_type(seg.get_segment_manager()) ); static auto capacity = seg.get_free_memory(); std::cerr << "Free space: " << (capacity>>30) << "g\n"; return *_instance; } int main() { auto& db = shared_instance(); bip::scoped_lock<mutex_type> lock(mutex); auto alloc = db.get_allocator().get_segment_manager(); std::cout << db.size() << '\n'; for (int i = 0; i < 1000; ++i) { std::string key_ = "item" + std::to_string(i); shared_string key(alloc); key.assign(key_.begin(), key_.end()); auto value = shared_vector<X>(alloc); value.resize(size_t(rand()%(1ul<<9))); auto entry = std::make_pair(key, value); db.insert(std::make_pair(key, value)); } }

Note that he writes a sparse 50G file. Actual size depends on a little random. My launch resulted in approximately 1.1G:

 $ du -shc --apparent-size demo.db 50G demo.db $ du -shc demo.db 1,1G demo.db

Hope this helps

Memory mapped files, managed file pointer and offset

More articles: