On Wed 22-11-17 10:35:10, Michal Hocko wrote: [...]
Moreover I am not really sure this is really working properly. Just look at the split_huge_page. It moves all the tail pages to the LRU list while migrate_pages has a list of pages to migrate. So we will migrate the head page and all the rest will get back to the LRU list. What guarantees that they will get migrated as well.
OK, so this is as I've expected. It doesn't work! Some pfn walker based migration will just skip tail pages see madvise_inject_error. __alloc_contig_migrate_range will simply fail on THP page see isolate_migratepages_block so we even do not try to migrate it. do_move_page_to_node_array will simply migrate head and do not care about tail pages. do_mbind splits the page and then fall back to pte walk when thp migration is not supported but it doesn't handle tail pages if the THP migration path is not able to allocate a fresh THP AFAICS. Memory hotplug should be safe because it doesn't skip the whole THP when doing pfn walk.
Unless I am missing something here this looks like a huge mess to me.