#include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #include #define int ll #define INT128_MAX (__int128)(((unsigned __int128) 1 << ((sizeof(__int128) * __CHAR_BIT__) - 1)) - 1) #define INT128_MIN (-INT128_MAX - 1) #define pb push_back #define eb emplace_back #define clock chrono::steady_clock::now().time_since_epoch().count() using namespace std; template ostream& operator<<(ostream& os, const pair pr) { return os << pr.first << ' ' << pr.second; } template ostream& operator<<(ostream& os, const array &arr) { for(size_t i = 0; T x : arr) { os << x; if (++i != N) os << ' '; } return os; } template ostream& operator<<(ostream& os, const vector &vec) { for(size_t i = 0; T x : vec) { os << x; if (++i != size(vec)) os << ' '; } return os; } template ostream& operator<<(ostream& os, const set &s) { for(size_t i = 0; T x : s) { os << x; if (++i != size(s)) os << ' '; } return os; } template ostream& operator<<(ostream& os, const map &m) { for(size_t i = 0; pair x : m) { os << x; if (++i != size(m)) os << ' '; } return os; } #ifdef DEBUG #define dbg(...) cerr << '(', _do(#__VA_ARGS__), cerr << ") = ", _do2(__VA_ARGS__) template void _do(T &&x) { cerr << x; } template void _do(T &&x, S&&...y) { cerr << x << ", "; _do(y...); } template void _do2(T &&x) { cerr << x << endl; } template void _do2(T &&x, S&&...y) { cerr << x << ", "; _do2(y...); } #else #define dbg(...) #endif using ll = long long; using ull = unsigned long long; using ldb = long double; using pii = pair; using pll = pair; //#define double ldb template using min_heap = priority_queue, greater>; template using max_heap = priority_queue; template, class OP = plus> void pSum(rng &&v) { if (!v.empty()) for(T p = v[0]; T &x : v | views::drop(1)) x = p = OP()(p, x); } template, class OP> void pSum(rng &&v, OP op) { if (!v.empty()) for(T p = v[0]; T &x : v | views::drop(1)) x = p = op(p, x); } template void Unique(rng &v) { ranges::sort(v); v.resize(unique(v.begin(), v.end()) - v.begin()); } template rng invPerm(rng p) { rng ret = p; for(int i = 0; i < ssize(p); i++) ret[p[i]] = i; return ret; } template rng Permute(rng v, rng2 p) { rng ret = v; for(int i = 0; i < ssize(p); i++) ret[p[i]] = v[i]; return ret; } template vector> readGraph(int n, int m, int base) { vector> g(n); for(int i = 0; i < m; i++) { int u, v; cin >> u >> v; u -= base, v -= base; g[u].emplace_back(v); if constexpr (!directed) g[v].emplace_back(u); } return g; } template void setBit(T &msk, int bit, bool x) { msk = (msk & ~(T(1) << bit)) | (T(x) << bit); } template void flipBit(T &msk, int bit) { msk ^= T(1) << bit; } template bool getBit(T msk, int bit) { return msk >> bit & T(1); } template T floorDiv(T a, T b) { if (b < 0) a *= -1, b *= -1; return a >= 0 ? a / b : (a - b + 1) / b; } template T ceilDiv(T a, T b) { if (b < 0) a *= -1, b *= -1; return a >= 0 ? (a + b - 1) / b : a / b; } template bool chmin(T &a, T b) { return a > b ? a = b, 1 : 0; } template bool chmax(T &a, T b) { return a < b ? a = b, 1 : 0; } int solve() { int n, k; cin >> n >> k; vector a(n); for(int &x : a) cin >> x; if (accumulate(a.begin(), a.end(), 0ll) % k != 0) return -1; int c = 0; for(int x : a) c += x % k; { auto check = [&]() { const int limit = c / k; for(int &x : a) if (x % k > limit) return false; return true; }; while(!check()) c += k; } dbg(c); ranges::sort(a); int sum = accumulate(a.begin(), a.end(), 0ll); dbg(c); vector b = a; int take_0 = 0; for(int &x : b) take_0 += x % k, x -= x % k; int ans = LLONG_MAX; for(int s = c; s < c + k * k; s += k) { bool tmp = false; auto pred = [&](int r) { int sp = s + r * k * k; const int limit = sp / k; int take = take_0; bool flag = false; for(int i = 0; i < n; i++) { int y = a[i] - b[i]; int z = min((limit - y) / k, b[i] / k); take += z * k; dbg(z, b[i] / k, i, n - k - 1); if (k == n or (i == (n - k - 1) and b[i] == z * k)) flag = true; if (take >= sp) break; } tmp = take < sp; if (flag) return false; else return take < sp; }; //cerr << "a\n"; //pred(2); int x = *ranges::partition_point(views::iota(0ll, (sum - s) / (k * k)), pred); dbg(x); pred(x); if (!tmp) chmin(ans, (s + x * k * k) / k); /* dbg(s, ans, x); if (s == 12) { cout << '\n'; dbg(pred(1)); cout << '\n'; } */ } return ans == LLONG_MAX ? -1 : ans; } signed main() { ios::sync_with_stdio(false), cin.tie(NULL); int t; cin >> t; while(t--) cout << solve() << '\n'; return 0; }