Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Delete index when corresponding collection is dropped #79

Closed
enrique-fernandez-polo opened this issue May 16, 2013 · 18 comments
Closed

Comments

@enrique-fernandez-polo
Copy link

We realize that when we drop a MongoDB collection through shell the index is not refreshed. We expect that if I remove a MongoDB collection from the shell then the index should be deleted.

@yvesx
Copy link

yvesx commented May 16, 2013

I had similar concerns. But apparently I can setup up rivers from different mongo collections to the same index.
So removing a mongo collection shouldn't entail deleting an index....

@enrique-fernandez-polo
Copy link
Author

The exact problem we are having is when we drop a collection, delete the index and create the same collection and the same index. The river reads the opslog collection from MongoDB and index unexisting documents that were inserted before the collection drop.

EDIT: Well this is related with this #47

@richardwilly98
Copy link
Owner

Hi,

I have posted the question to MongoDB user group [1].

[1] - https://groups.google.com/forum/?fromgroups#!topic/mongodb-user/-zj9ojkfN_g

Thanks,
Richard.

@enrique-fernandez-polo
Copy link
Author

I've made a fast fork with little changes for getting notified of the drop collection event. I had to get rid of the id field because it's not used in oplog.rs when reporting a drop collection event.

https://github.com/Quaiks/elasticsearch-river-mongodb-drop-collection-event

@richardwilly98
Copy link
Owner

Hi,

Thanks I will merge the changes.

Thanks,
Richard.

On Fri, May 17, 2013 at 10:43 AM, Enrique Fernández-Polo Puertas <
[email protected]> wrote:

I've made a fast fork with little changes for getting notified of the drop
collection event. I had to get rid of the id field because it's not used in
oplog.rs when reporting a drop collection event.

https://github.com/Quaiks/elasticsearch-river-mongodb-drop-collection-event


Reply to this email directly or view it on GitHubhttps://github.com//issues/79#issuecomment-18065513
.

richardwilly98 added a commit that referenced this issue May 17, 2013
- Enable drop collection feature using drop_collection: true in options
attribute (see test-simple-mongodb-river-drop-collection.json)
@missinglink
Copy link

Works for me. es-river-mongodb v1.6.9 mongo v2.4.4

  #!/bin/sh
  curl -XPUT "http://localhost:9200/_river/product-$type/_meta" -d '
  {
    "type": "mongodb",
    "mongodb": {
      "servers": [
        { "host": "127.0.0.1", "port": 27017 }
      ],
      "options": { "secondary_read_preference": true, "drop_collection": true },
      "db": "product",
      "collection": "'$type'"
    },
    "index": {
      "name": "product",
      "type": "'$type'"
    }
  }'

@missinglink
Copy link

Once I've dropped the database, then I re-create the collection, new records don't seem to sync any more.
If I remove the "drop_collection": true setting then all the old deleted records come back...

Is the patch starting sync operations again once the collection is re-created?

@richardwilly98
Copy link
Owner

The river always uses oplog.rs collection.
So I believe when you drop the collection Mongo does not delete the associated entries in oplog.rs but if you drop the database Mongo also drop the associated entries in oplog.rs

Can you please just clarify your scenario before I take a look?

Thanks,
Richard.

@missinglink
Copy link

Sure, I've noticed another issue with this feature too. Maybe I'm using it wrong, let me explain.

Scenario 1

  • Created a mongo db
  • Created a collection
  • Add records to collection
  • Drop database
  • Recreate mapping
  • Recreate river WITHOUT "drop_collection": true

Expected: Zero documents indexed in elastic
Actual: Many documents indexed in elastic

Scenario 2

  • Created a mongo db
  • Created a collection
  • Add records to collection
  • Drop database
  • Recreate mapping
  • Recreate river WITH "drop_collection": true

Expected: Zero documents indexed in elastic
Actual: Zero documents indexed in elastic

  • Add record to collection

Expected: A single document indexed in elastic
Actual: Zero documents indexed in elastic

Scenario 3

  • Created a mongo db
  • Created a collection
  • Add records to collection
  • Drop database
  • Recreate mapping
  • Recreate river WITH "drop_collection": true

Expected: Mapping remains as set when index was created
Actual: Mapping is replaced with a default mapping

@enrique-fernandez-polo
Copy link
Author

This feature only works when dropping collections and no databases.

@richardwilly98
Copy link
Owner

Hi,

@Quaiks is correct the river only monitor "drop collection" action from oplog.rs.

Thanks,
Richard.

@missinglink
Copy link

Cool, thanks I'll test that out.
What about scenario #3 resetting my mapping?

@missinglink
Copy link

I am still having issues with scenario #2, when I use "drop_collection": true then no documents are synced via the river. Can anyone reproduce this or is it just me?

Note: I am not dropping the db, just the collection with:

mongo> use foo
mongo> db.bar.drop()

richardwilly98 added a commit that referenced this issue Jul 15, 2013
@richardwilly98
Copy link
Owner

Hi,

I believe you found a bug. I have committed a fix.
I hope to be able to provide a new release this week.

Thanks,
Richard.

@richardwilly98
Copy link
Owner

Fix is available in release 1.6.11.

Thanks,
Richard.

@missinglink
Copy link

Sorry for the delay, have been enjoying the British heatwave in the Lake District.
Thanks for the bugfix @richardwilly98 I will put it to the test and report back.

@missinglink
Copy link

Great, upgraded to 1.6.11 and collection.drop() seems to work great now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants